WO 01/23604 



PCT/CA00/01150 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Salmonella choleraesuis subsp. choleraesuds 
serotype Gal 1 inarum 
5 (B) STRAIN: ATCC 9184 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 732 

CTGGTTGTTG CTGCGACTGA CGGCCCGATG CCGCAGACCC GTGAGCACAT 5 0 

10 CCTGCTGGGT CGTCAGGTAG GCGTTCCGTA CATCATCGTG TTCCTGAACA 100 

AATGCGACAT GGTTGATGAC GAAGAGCTGC TGGAACTGGT TGAAATGGAA 150 

GTTCGYGAAC TGCTGTCTCA GTACGACTTC CCGGGCGACG ACACTCCGAT 2 00 

CGTTCGTGGT TCTGCTCTGA AAGCGCTGGA AGGCGACGCA GAGTGGGAAG 2 50 

CGAAAATCAT CGAACTGGCT GGCTTCCTGG ATTCTTACAT CCCGGAACCA 300 

15 GAGCGTGCGA TTGACAAGCC GTTCCTGCTG CCGATCGAAG ACGTATTCTC 3 50 

CATCTCCGGT CGTGGTACCG TTGTTACCGG TCGTGTAGAA CGCGGTATCA 400 

TCAAAGTGGG CGAAGAAGTT GAAATCGTTG GTATCAAAGA GACT CAGAAG 450 

TCTACCTGTA CTGGCGTTGA AATGTTCCGC AAACTGCTGG ACGAAGGCCG 50 0 

TGCCGGTGAG AACGTAGGTG TTCTGCTGCG TGGTATCAAA CGTGAAGAAA 55 0 

20 TCGAACGTGG TCAGGTACTG GCTAAGCCGG GCACCATCAA GCCGCACACC 600 

AAGTTCGAAT CTGAAGTGTA CATTCTGTCC AAAGATGAAG GCGGCCGTCA 650 

CACTCCGTTC TTCAAAGGCT ACCGTCCGCA GTTCTACTTC CGTACTACTG 70 0 

ACGTGACTGG CACCATCGAA CTGCCGGAAG GCGTAGAGAT GGTAATGCCG 75 0 

GGCGACAACA TCAAAATGGT TGTTACCCTG ATCCACCCGA TCGCAATGGA 80 0 

25 CGACGGTCTG CG 812 



30 



45 



2) INFORMATION FOR SEQ ID NO: 733 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 814 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 
35 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
40 (A) ORGANISM: Salmonella choleraesuis subsp. choleraesuii 

serotype Paratyphi B 
(B) STRAIN: ATCC 8759 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 733 



TGGTTGTTGC TGCGACTGAC GGYCCGATGC CGCAGACCCG TGAGCACATC 50 

CTGCTGGGTC GTCAGGTAGG CGTTCCGTAC ATCATCGTGT TCCTGAACAA 100 

ATGCGACATG GTTGATGACG AAGAGCTGCT GGAACTGGTT GAAATGGAAG 150 

TTCGYGAACT GCTGTCTCAG TACGACTTCC CGGGCGACGA CACTCCGATC 200 

50 GTTCGTGGTT CTGCTCTGAA AGCGCTGGAA GGCGACGCAG AGTGGGAAGC 2 50 

GAAAATCATC GAACTGGCTG GCTTCCTGGA TTCTTACATT CCGGAACCAG 3 00 

AGCGTGCGAT TGACAAGCCG TTCCTGCTGC CGATCGAAGA CGTATTCTCC 350 

ATCTCTGGTC GTGGTACCGT TGTTACCGGT CGTGTAGAAC GCGGTATCAT 4 00 

CAAAGTGGGC GAAGAAGTTG AAATCGTTGG TATCAAAGAG ACTCAGAAGT 450 

55 CTACCTGTAC TGGCGTTGAA ATGTTCCGCA AACTGCTGGA CGAAGGCCGT 500 

GCYGGTGAGA ACGTAGGTGT TCTGCTGCGT GGTATCAAAC GTGAAGAAAT 550 

CGAACGTGGT CAGGTACTGG CTAAGCCGGG CACCATCAAG CCGCACACCA 600 

AGTTCGAATC TGAAGTGTAC ATTCTGTCCA AAGATGAAGG CGGCCGTCAT 650 

ACTCCGTTCT TCAAAGGCTA CCGTCCGCAG TTCTACTTCC GTACTACTGA 700 

60 CGTGACTGGC ACCATCGAAC TGCCGGAAGG CGTAGAGATG GTAATGCCGG 750 

418 
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GCGACAACAT CAAAATGGTT GTTACCCTGA TCCACCCGAT CGCAATGGAC 80tf 
GACGGTCTGC GTTT 814 



2) INFORMATION FOR SEQ ID NO: 734 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 828 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Salmonella choleraesuis subsp. choleraesuis 
serotype Virchow 

(B) STRAIN: ATCC 51955 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 734 

GGCGCGATCC TGGTTGTTGC TGCGACTGAC GGCCCGATGC CGCAGACCCG 
TGAGCACATC CTGCTGGGTC GTCAGGTAGG CGTTCCGTAC ATCATCGTGT 
TCCTGAACAA ATGCGACATG GTTGATGACG AAGAGCTGCT GGAACTGGTT 
GAAATGGAAG TTCGTGAACT TCTGTCTCAG TACGACTTCC CGGGCGACGA 
CACTCCGATC GTTCGTGGTT CTGCTCTGAA AGCGCTGGAA GGCGACGCAG 
AGTGGGAAGC GAAAATCATC GAACTGGCTG GCTTCCTGGA TTCTTACATT 
CCGGAACCAG AGCGTGCGAT TGACAAGCCG TTCCTGCTGC CGATCGAAGA 
CGTATTCTCC ATCTC CGGTC GTGGTACCGT TGTTACCGGT CGTGTAGAAC 
GCGGTATCAT CAAAGTGGGC GAAGAAGTTG AAATCGTTGG TATCAAAGAG 
ACTCAGAAGT CTACCTGTAC TGGCGTTGAA ATGTTC CGCA AACTGCTGGA 
CGAAGGCCGT GCCGGTGAGA ACGTAGGTGT TCTGCTGCGT GGTATCAAAC 
GTGAAGAAAT CGAACGTGGT CAGGTACTGG CTAAGCCGGG CACCATCAAG 
CCGCACACCA AGTTCGAATC TGAAGTGTAC ATTCTGTCCA AAGATGAAGG 
CGGCCGTCAT ACTCCGTTCT TCAAAGGCTA CCGTCCGCAG TTCTACTTCC 
GTACTACTGA CGTGACTGGC ACCATCGAAC TGCCGGAAGG CGTAGAGATG 
GTAATGCCGG GCGACAACAT CAAAATGGTT GTTACCCTGA TCCACCCGAT 
CGCAATGGAC GACGGTCTGC GTTTCGCA 



50 
100 
150 
200 
250 
300 
350 
400 
450 
500 
550 
600 
650 
700 
750 
800 
828 



2) INFORMATION FOR SEQ ID NO: 735 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 825 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Serratla grimesii 

(B) STRAIN: ATCC 14460 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 735 

GGCGCTATCC TGGTTGTTGC TGCGACTGAT GGCCCAATGC CACAGACCCG 50 
TGAGCACATC CTGCTGGGTC GTCAGGTTGG CGTTCCTTTC ATCATCGTAT 100 

419 
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TCATGAACAA ATGCGACATG GTTGATGATG 
GAAATGGAAG TTCGTGAACT TCTGTCTGCT 
CCTGCCAGTT GTTCGTGGTT CAGCGCTGAA 
AGTGGGAAGC TAAAATCATC GAACTGGCTG 
5 CCAGAACCAG AGCGTGCTAT CGACAAGCCG 
CGTATTCTCC ATCTCCGGYC GTGGTACYGT 
GCGGTATCGT TAAAGTTGGC GAAGAAGTTG 
ACCGTTAAGT CTACCTGTAC TGGCGTTGAA 
CGAAGGCCGT GCTGGTGAGA ACGTAGGTGT 

10 GTGAAGACAT CGAACGTGGT CAGGTACTGG 
CCACACACCA AATTCGACTC AGAAGTTTAC 
TGGTCGTCAC ACTCCATTCT TCAAAGGCTA 
GTACAACTGA CGTGACCGGT ACCATCGAAC 
GTAATGCCAG GCGATAACGT GAACATGGTT 

15 CGCGATGGAC GACGGTCTGC GTTTC 



PCT/CA00/01150 

AAGAGCTGCT GGAACTGGTA 150, 

TATGACTTCC CTGGTGATGA 2 00 

AGCACTGGAA GGCGAAGCTG 250 

GCTACCTGGA TTCTTACATC 300 

TTCCTGCTGC CAATCGAAGA 350 

AGTTACCGGT CGTGTAGAGC 400 

AAATCGTTGG TATCAAAGAC 450 

ATGTTCCGCA AACTGCTGGA 50 0 

TCTGCTGCGT GGTATCAAGC 550 

CTAAACCAGG TTCAATCAAG 60 0 

ATCCTGAGCA AAGAAGAAGG 650 

CCGTCCACAG TTCTACTTCC 70 0 

TGCCAGAAGG CGTAGAGATG 750 

GTAACCCTGA TTCACCCAAT 800 

825 



20 



2) INFORMATION FOR SEQ ID NO: 73 6 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 798 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
25 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
30 (A) ORGANISM: Clostridium difficile 

(B) STRAIN: ATCC 9689 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 736 

35 TATTAGTTTG TTCAGCAACA GATGGACCAA TGCCACAAAC AAGAGAGCAT 50 

ATACTATTAT CAAGACAAGT TGGAGTACCA TATATAGTAG TATTCTTAAA 100 

CAAATGTGAC ATGGTAGATG ATGAAGAGTT ATTAGAGTTA GTAGAGATGG 150 

AAGTAAGAGA TTTATTAACA GAATATGATT TCCCAGGAGA TGACACTCCA 200 

ATAGTAAGAG GTTCAGCATT AATGGCATTA GAAGATCCAA AGAGYGAGTG 250 

40 GGGAGATAAG ATAGTAGAAT TATTCGAGCA AATAGATGAG TATATACCAG 300 

CTCCAGAGAG AGATACAGAT AAACCATTCT TAATGCCAGT AGAGGACGTA 350 

TTCTCAATCA CAGGAAGAGG AACAGTTGCA ACAGGAAGAG TGGAAAGAGG 400 

AGTACTAAAA GTACAAGACG AAGTAGAKTT AGTAGGATTA ACAGAAGCAC 450 

CAAGAAAAGT AGTAGTAACA GGAGTAGAGA TGTTCAGAAA ATTATTAGAC 500 

45 CAAGCACAAG CAGGGGATAA TATAGGAGCA TTATTAAGAG GAGTACAAAG 550 

AAACGAGATA GAAAGAGGAC AAGTACTAGC AAAGACTGGA TCAGTAAAGG 600 

CACACACAAA GTTTACAGCA GAAGTATATG TACTTAAAAA AGAAGARGGT 650 

GGAAGACATA CACCATTCTT TGATGGATAT AGACCACAAT TCTATTTCAG 700 

AACAACAGAC GTAACAGGAG CTTGTAAGTT ACCAGAAGGA ATAGAGATGG 750 

50 TAATGCCTGG AGATAACGTA ACAATGGAAG TAGACTTAAT AAACTCAA 798 



2) INFORMATION FOR SEQ ID NO: 737 

55 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 411 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
60 (D) TOPOLOGY: Linear 

420 
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(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
5 (A) ORGANISM: Burkholderia pseudomallei 

(B) STRAIN: 1026B 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 73 7 

10 GTCAACATGA TGGAGCTCAT CAACAACATC GCGAAGGAGC ACGGCGGTTA 50 

CTCCGTGTTC GCGGGCGTGG GCGAGCGTAC CCGTGAAGGG AACGACTTCT 100 

ACCACGAAAT GAAGGACTCG AACGTTCTCG ACAAGGTCGC GCTGGTGTAC 150 

GGCCAGATGA ACGAGCCGCC GGGCAACCGT CTGCGCGTGG CGCTGACGGG 2 00 

CCTCACGATG GCCGAGCACT TCCGTGACGA AGGCCTCGAC GTGCTGTTCT 250 

15 TCGTCGACAA CATCTACCGT TTCACGCTGG CCGGTACCGA AGTGTCGGCG 3 00 

CTGCTCGGCC GTATGCCGTC GGCAGTGGGC TATCAGCCGA CGCTGGCTGA 350 

AGAAATGGGC AAGCTGCAAG AGCGCATCAC GTCGACGAAG AAGGGCTCGA 4 00 

TCACGTCGGT T 411 



20 



35 



2) INFORMATION FOR SEQ ID NO: 73 8 



(i) SEQUENCE CHARACTERISTICS: 
25 (A) LENGTH: 394 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

3 0 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Clostridium bifejrmentans 

(B) STRAIN: ATCC 63 8 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 738 



TACAAGAGCT TATTAACAAT ATAGCTACTC AACACGGTGG TATATCAGTA 50 

TTCGCAGGTG TTGGAGAGAG AACAAGAGAA GGTAACGACT TATTCCATGA 100 

40 GATGAGCGAT ACAGGAGTTA TAAATAAAAC AGCTCTAGTA TTCGGACAAA 150 

TGAATGAGCC ACCTGGAGCA AGAATGAGAG TTGCTTTAAC TGGTCTTACA 2 00 

ATGGCTGAAT ACTTCAGAGA TCAACAAGGG CAAGACGTTT TATTATTCGT 250 

AGATAATATA TTCCGTTTCA CTCAAGCAGG ATCTGAGGTT TCTGCACTTC 300 

TTGGACGTAC TCCATCAGCA GTTGGATACC AACCAACATT AGCAACAGAG 350 

45 ATGGGTAGAT TACAAGAGAG AATAACATCT ACAAATAAAG GGTC 394 



2 ) INFORMATION FOR SEQ ID NO: 739 



50 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 394 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
55 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

421 
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(A) ORGANISM: Clostridium beijerincki (deposited as" 
Clostridium butyricum) 

(B) STRAIN: ATCC 8260 

5 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 739 

TTAATAAACA ACATAGCTAA ACAACATGGT GGTTTATCAG TATTTACTGG 5 0 

AGTTGGTGAA AGATCAAGAG AAGGTAATGA CTTATATCAT GAAATGAGAG 100 

AGTCAGGAGT TATTGATAAG ACAGCATTAG TATTTGGACA AATGAATGAG 150 

10 CCACCGGGTG CCAGAATGAG AGTTGCATTA ACAGGTCTTA CTATGGCAGA 2 00 

GTATTTTAGA GATAAAGGTC AAGATGTGTT ACTATTCATA GATAACATAT 2 50 

T CAG AT AT AC TCAAGCAGGT TCAGAGGTTT CAGCATTACT TGGAAGAACA 300 

CCTTCAGCGG TTGGATATCA GCCAACACTT GCAACTGAAA TGGGTGCACT 350 

TCAGGAAAGA ATTACATCAA CAGTTAATGG TTCTATTACG TCAG 394 

15. 



2 ) INFORMATION FOR SEQ ID NO: 74 0 

20 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 93 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



25 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Clostridium difficile 
30 (B) STRAIN: ATCC 9689 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 740 

TTATAAACAA TATTGCTAAG CAACATGGTG GTATTTCTGT ATTTTCAGGA 50 

35 GTAGGAGAAA GAACAAGAGA AGGTAACGAC CTTTATGGCG AAATGAGTGA 100 

GTCTGGAGTT ATAAATAAAA CAGCTCTAGT ATTTGGTCAA ATGAATGAAC 150 

CACCTGGAGC GAGAATGAGA GTTGCTTTAA CTGGACTTAC AATGGCAGAA 200 

• CATTTTAGAG ATGAGCAAGG ACAAGACGTT TTACTTTTCG TTGATAATAT 2 50 

ATTCCGTTTC ACACAAGCTG GTTCAGAAGT TTCAGCACTT CTAGGACGTA 300 

40 TGCCATCAGC TGTTGGTTAT CAGCCAACAT TAGCTACTGA AATGGGTGCA 3 50 

CTTCAAGAGA GAATAACATC AACTAAGAAA GGTTCAATAA CAT 393 



45 2) INFORMATION FOR SEQ ID NO: 741 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 98 bases 

(B) TYPE: Nucleic acid 
50 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

55 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Clostridium ramosum 

(B) STRAIN : ATCC 25582 



60 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 741 

422 
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TTGATTCAAG AATTCATTAA 
AGTTTTTGCT GGAGTTGGTG 
ATGAAATGAA GGAAAGTGGT 
CAGATGAATG AACCCCCAGG 
TACTATGGCA GAAGAATTCC 
TCATCGATAA TATTTTCCGT 
TTACTTGGAC GGGTACCATC 
CGAAATGGGT GCTTTACAAG 



TAACATTGCT 
AACGTAGCCG 
GTTTTATCTA 
AGCTCGTTTA 
GTGATGAACA 
TTTACTCAAG 
ACAAGCTGGG 
AACGGATTAC 



ACAGAACATG GTGGTTTATC 
TGAAGGTAAT GATTTATATT 
AAACAACACT AGTATTTGGA 
AGAGTTGCTT TAACGGGTCT 
AGGTCAGGAT GTCTTATTAT 
CTGGATCTGA AGTATCTGCC 
TATCAGCCAA CTTTAGCAAC 
ATCAACTAAA AAAGGATC 



50 
100 
150 
200 
250 
300 
350 
398 



2) INFORMATION FOR SEQ ID NO: 742 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 80 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Clostridium septicum 

(B) STRAIN: ATCC 12 964 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 742 

TAGCTAAGGA ACACGGTGGA CTTTCAGTAT TCACAGGTGT TGGAGAAAGA 50 

TCAAGAGAAG GTAATGATTT ATATTACGAA ATGAAAGAAT CAGGAGTTAT 100 

AGACAAGACA GCTCTAGTGT TTGGACAAAT GAATGAATCT CCAGGAGCTA 150 

GAATGAGAGT ATCTTTAACA GGATTAACTA TGGCTGAATA TTTCAGAGAT 2 00 

CAAGGTCAAG ATGTGCTTTT ATTCATAGAT AACATATTTA GATTTACTCA 250 

AGCTGGATCA GAAGTATCGG CTTTACTTGG AAGAATACCA TCAGCAGTTG 3 00 

GTTATCAACC AACACTAGCA ACTGAAATGG GTGCACTTCA AGAAAGAATT 350 

ACTTCAACTA AAAATGGATC AATAACTTCA 3 80 



2) INFORMATION FOR SEQ ID NO: 743 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 389 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Clostridium tertium 

(B) STRAIN: ATCC 14573 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 743 

TTAATAAATA ATATAGCAAA AGAGCATGGT GGTCTTTCTG TATTTACAGG 50 

AGTTGGAGAA AGGTCAAGAG AAGGTAACGA CTTATATTAT GAAATGAAAG 100 

AGTCAGGGGT TATAGATAAG ACAGCTTTAG TATTTGGACA AATGAATGAA 150 

TGACCAGGAG CAAGAATGAG AGTTTCATTA ACTGGATTAA CTATGGCTGA 2 00 

ATATTTTAGA GATCAAGGTC AAGACGTTCT TTTATTTATA GATAATATAT 250 

TTAGATTTAC TCAAGCGGGA TCAGAAGTTT CTGCGTTATT AGGAAGAATT 3 00 



423 
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CCTTCAGCAG TTGGATATCA ACCAACTCTT GCAACTGAAA TGGGAGCACT 35t) 
TCAAGAAAGA ATAACATCAA CAAAGAATGG ATCAATCAC 389 



2) INFORMATION FOR SEQ ID NO: 744 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 843 bases 
10 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



15 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Comamonas acidovorans 

(B) STRAIN: ATCC 15668 

20 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 744 

TTCCCCCGCA CGCATGCCCA AGGTGTTCGA TGCCCTGAAG CTCGACGGCT 50 

CGGCCCTGAC GCTGGAAGTG CAGCAACTGC TGGGTGACGG CGTTGTGCGT 100 

ACCATCGCCC TGGGTTCGTC CGACGGTCTG CGTCGCGGCC TGATGGTGTC 150 

25 CAACACCGGC AACCCCATCA CCGTGCCCGT GGGCAAGGCG ACGCTGGGTC 200 

GCATCATGGA CGTGCTGGGC AATCCCATCG ACGAACGTGG TCCCGTGGAT 250 

CAGGCGCTGA CGGCTCCCAT CCACCGCAAG GCACCGGCTT ATGACGAGCT 300 

GTCGCCTTCG CAGGAACTGC TGGAAACCGG CATCAAGGTG ATCGACCTGA 3 50 
TCTCGCCCTT CGCCAAGGGC GGCAAGGTGG GTCTGTTCGG TGGCGCCGGT - 400 

30 GTGGGCAAGA CCGTGAACAT GATGGAACTC ATCAACAACA TCGCCAAGGG 450 

CCACGGTGGT CTGTCGGTGT TCGCCGGTGT GGGTGAACGT ACCCGCGAAG 500 

GCAATGACTT CTATCACGAA ATGTCGGACG CCGGCGTGGT CAACCAGGAG 550 

TCGCTGAACG ACTCCAAGGT GGCCATGGTC TACGGCCAGA TGAACGAACC 600 

CCCGGGCAAC CGTCTGCGCG TGGCGCTGAC CGGCCTGACC ATGGCCGAAG 650 

35 CCTTCCGTGA CGAAGGCAAG GACGTGCTGT TCTTCGTGGA CAACATCTAC 700 

CGCTACACGC TGGCCGGTAC CGAAGTGTCC GCTCTGCTGG GTCGCATGCC 750 

TTCCGCCGTG GGCTACCAGC CCACGCTGGC CGAGGAAATG GGCCGCCTGC 800 

AAGAGCGCAT CACCTCGACC AAGGTCGGTT CGATCACTTC CAC 843 



40 



2) INFORMATION FOR SEQ ID NO: 745 



(i) SEQUENCE CHARACTERISTICS: 
45 (A) LENGTH: 819 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

50 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Klebsiella pneumoniae subsp. 
rhinoscleromatis 
55 (B) STRAIN: ATCC 13884 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 745 

GCCGTACCAC GCGTGTACGA AGCCCTTGAG GTACAGAATG GTAATGAAGT 50 
60 TCTGGTGCTG GAAGTTCAGC AGCAGCTGGG CGGCGGTATC GTACGTACCA 100 

424 
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TCGCCATGGG TTCTTCTGAT GGTCTGCGCC GCGGTCTGGA TGTAAAAGAC 450 

CTCGAGCACC CGATCGAAGT CCCGGTAGGT AAAGCAACGC TGGGTCGTAT 200 

CATGAACGTA CTGGGTCAAC CGGTTGACAT GAAAGGCGAC ATCGGCGAAG 250 

AAGAGCGTTG GGCTATCCAC CGCGCGGCAC CGTCCTATGA AGAGCTGTCC 3 00 

5 AGCTCTCAGG AACTGCTGGA AACCGGCATC AAAGTTATCG ACCTGATGTG 3 50 

TCCGTTCGCC AAGGGCGGTA AAGTTGGTCT GTTCGGCGGT GCGGGTGTAG 4 00 

GTAAAACTGT AAACATGATG GAGCTGATCC GTAACATCGC GATCGAGCAC 4 50 

TCCGGTTACT CTGTGTTTGC GGGCGTAGGT GAGCGTACTC GTGAGGGTAA 500 

CGACTTCTAC CACGAAATGA CCGACTCCAA CGTTATCGAT AAAGTATCCC 550 

10 TGGTGTACGG CCAGATGAAC GAGCCGCCGG GAAACCGTCT GCGCGTTGCG 600 

CTGACCGGCC TGACCATGGC TGAGAAATTC CGTGACGAAG GTCGTGACGT 650 

ACTGCTGTTC GTCGATAACA TCTATCGTTA CACCCTGGCC GGTACTGAAG 700 

TATCCGCGCT GCTGGGTCGT ATGCCTTCAG CGGTAGGTTA TCAGCCGACC 750 

CTGGCGGAAG AGATGGGCGT TCTGCAGGAA CGTATCACCT CCACCAAAAC 80 0 

15 CGGTTCTATC ACCTCCGTA 8 3. 9 



20 



2) INFORMATION FOR SEQ ID NO: 746 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 824 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 
25 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
30 (A) ORGANISM: Neisseria canis 

(B) STRAIN: ATCC 14687 

(xi> SEQUENCE DESCRIPTION: SEQ ID NO: 74 6 

3 5 GCGATTCTAT TCCGCGCGTA TATGATGCTC TTAAACTAGT GGATAGAGAA 50 

CTGACGCTTG AAGTACAACA ACAGTTGGGT GATGGTGTCG TTCGTACTAT 100 

TGCGATGGGT AGTTCCGACG GCCTCAAACG AGGTTTGGCG GTAGTTAACA 150 

CCGGTGCTCC AATTACAGTG CCTGTGGGCA AAGCAACATT AGGCCGTATT 200 

ATGGACGTAT TAGGTAATCC GGTTGATGAA GCTGGTCCGA TTGGCTCCGA 250 

40 GCAAACCCGA GCAATCCACC AACCTGCTCC TAAGTTCGAC GAGCTTTCTA 3 00 

GCGCCACAGA GATTTTGGAA ACAGGTATTA AAGTAATTGA TTTGCTTTGC 350 

CCGTTTGCCA AAGGCGGTAA AGTAGGTTTG TTTGGTGGTG CGGGAGTGGG 4 00 

CAAAACCGTA AATATGATGG AGTTGATTAA CAACATCGCG AAAGCACACA 450 

GCGGTTTGTC TGTATTTGCC GGTGTGGGTG AACGGACGCG TGAAGGTAAT 500 

45 GACTTTTATC ATGAGATGAA AGATTCCAAT GTATTAGATA AAGTAGCCAT 550 
GGTTTACGGT CAGATGAACG AGCCTCCCGG TAACCGTTTG CGCGTTGCGC . 600 

TAACTGGCTT GTCTATGGCC GAATTCTTCC GTGACGAGAA AGATGAAAAC 650 

GGTAAAGGCC GTGATGTATT GTTCTTTGTA GACAATATTT ACCGCTATAC 700 

CCTAGCCGGT ACAGAAGTAT CTGCATTGCT TGGCCGTATG CCTTCGGCAG 750 

50 TAGGTTATCA GCCGACGTTG GCAGAGGAAA TGGGCCGCTT GCAAGAGCGT 800 

ATTACYTCAM CCCAAACAGG CTCT 824 



55 2) INFORMATION FOR SEQ ID NO: 747 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 831 bases 

(B) TYPE: Nucleic acid 
60 (C) STRANDEDNESS: Double 

425 
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TOPOLOGY: Linear 



10 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Neisseria, cinerea 

(B) STRAIN: ATCC 14685 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 74 7 



CGCGACGCTA 
TCTGACTCTG 
TTGCAATGGG 
ACTGGTGCGC 

15 TGTCGACGTA 
ACAAAAGCCG 
TCAGCTACCG 
TCCGTTTGCT 
GCAAAACCGT 

2 0 AGCGGTCTGT 
CGACTTCTAC 
TGGTGTATGG 
TTGAC CGGTT 
CGGTAAAGGC 

25 CTTTGGCCGG 
GTGGGTTACC 
TATTACCTCT 



TCCCGCATGT 
GAGGTTCAAC 
TAGTTCAGAC 
CAATCACTGT 
TTGGGTACGC 
TGCCATTCAC 
AATTGTTGGA 
AAAGGCGGTA 
GAACATGATG 
CCGTGTTCGC 
CACGAGATGA 
C C AAATGAAC 
TGACTATGGC 
CGCGACGTAT 
TACTGAAGTA 
AACCGACATT 
ACCCAAACCG 



TTACGATGCC 
AGCTTCTGGG 
GGCCTTAAAC 
GCCGGTAGGT 
CTGTTGATGA 
CAAACTGCTC 
AACCGGTATT 
AAGTAGGTCT 
GAATTGATCA 
AGGTGTGGGT 
AAGATTCCAA 
GAACCTCCGG 
CGAATACTTC 
TGTTCTTCGT 
TCTGCACTGT 
GGCTGAAGAA 
GTTCCATTAC 



CTGAAATTGG 
CGACGGCGTT 
GCGGTATGTC 
AAAGGTACAT 
AGCAGGTCCG 
CGAAATTCGA 
AAAGTGATCG 
GTTCGGTGGT 
ACAACATCGC 
GAGCGTACCC 
CGTATTGGAT 
GCAACCGTCT 
CGTGACGAAA 
TGACAACATC 
TGGGCCGTAT 
ATGGGTCGTT 
T 



ACGAGAACGG 50 

GTCCGTACTA 100 

TGTAAGCAAT 150 

TGGGTCGTAT 2 00 

ATCGATACCG 2 50 

CGAGTTGTCT 300 

ACTTGCTGTG 350 

GCCGGTGTGG 4 00 

CAAAGCGCAC 450 

GTGAAGGTAA 500 

AAAGTAGCCA 550 

GCGCGTTGCT 6 00 

AAGACGAAAA 650 

TACCGTTACA 700 

GCCTTCTGCA 750 

TGCAAGAGCG 8 00 

831 



30 



40 



2 ) INFORMATION FOR SEQ ID NO: 748 



(i) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 862 bases 
35 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Neisseria cuniculi 

(B) STRAIN: ATCC 14 688 

45 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 748 

CCGTGGCCAA GTACCACAAA TTTATGACGC ACTGAGTGTT GATGGCACCG 50 

AAACAACCTT GGAAGTTCAA CAGCAGTTGG GTGATGGCGT GGTGCGTACC 100 

ATTGCGATGG GTTCAACCGA AGGCTTGAAG CGTGGTTTGA CTGTATCTAA 150 

50 CTCTGGTGCA CCGATTTCTG TGCCAGTGGG TCAAGCGACT TTGGGTCGTA 200 

TTATGGATGT GTTGGGTCGT CCAATCGACG AGGCAGGTCC TGTAAATGCT 250 

CAAGAAAAAT GGTCAATTCA CCGTGAAGCA CCAAGCTATG ATGAGCAATC 3 00 

AAACTCAACT GAGCTGCTAG AAACAGGCAT CAAAGTAATT GATTTGCTTT 3 50 

GCCCATTTGC TAAAGGTGGT AAAGTTGGCT TGTTCGGTGG TGCAGGTGTG 400 

55 GGTAAAACCG TGAATATGAT GGAGCTGATT AATAATATCG CTCTGAAGCA 450 

TTCAGGTCTT TCTGTTTTTG CAGGTGTTGG TGAGCGTACT CGTGAGGGTA 500 

ACGATTTTTA TCACGAAATG CAAGAAGCAG GCGTTGTTAA TACCGAAGAC 550 

TTCACCAAGT CAAAAGTAGC GATGGTTTAT GGTCAGATGA ATGAGCCACC 600 

AGGAAACCGT TTGCGTGTTG CATTGACAGG CTTGACGATG GCAGAATATT 650 

60 TCCGTGACGA AAAAGATGAA GCAACAGGCA AAGGGCGTGA TGTTCTATTG 700 
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TTCGTTGATA ACATCTATCG 
ACTTCTAGGT CGTATGCCAT 
AAGAAATGGG TGCGTTGCAA 
ATCACTTCGG GG 



TTACACACTG GCTGGTACGG 
CAGCAGTAGG TTATCAACCG 
GAGCGTATTA CCTCAACGCA 



AAGTGTCAGC 7 SO'' 

ACTCTGGCTG 800 
ATCGGGTTCG 850 

862 



2 ) INFORMATION FOR SEQ ID NO: 749 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 844 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Neisseria elongata subsp. elongata 

(B) STRAIN: ATCC 25295 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 749 

GGAACTCCCA CGTGACGCTA TCCCGCATGT TTTTGATGCA TTAAAATTAG 50 

TTGAAAATGA CCTAACCTTA GAAGTTCAAC AACTTTTGGG GGATGGTGTA 100 

GTGCGTACCA TTGCGATGGG TAGTTCAGAT GGATTAAAGC GTGGTATGGC 150 

TGTGAATAAT ACCGGAGCTC CGATTACTGT TCCTGTTGGC CGTGAAACTT 200 

TGGGTCGTAT CATGGATGTA TTGGGTAATC CGGTTGATGA GGCAGGTCCG 250 

GTAAATGCAT CCAATACACG TGCGATCCAT CAAGAGGCTC CTAAGTTTGA 300 

TGAGCTTTCT TCAACAACGG AATTATTAGA AACTGGCATT AAGGTTATCG 350 

ACTTGTTATG TCCGTTTGCC AAAGGTGGTA AAGTAGGTCT GTTTGGTGGT 400 

GCGGGTGTAG GTAAAACCGT AAATATGATG GAGTTAATTA ACAACATTGC 450 

CAAGGCACAT AGCGGTTTGT CTGTGTTTGC AGGCGTGGGT GAACGTACTC 500 

GTGAAGGTAA TGACTTCTAT CACGAGATGA AAGATTCCAA CGTATTGGAC 550 

AAAGTGGCAA TGGTTTACGG TCAGATGAAC GAACCTCCAG GCAACCGTCT 600 

GCGCGTTGCT TTGACCGGTT TGACTATGGC CGAATACTTC CGTGACGAAA 650 

AAGACGAAAA CGGTAAAGGT CGCGACGTAT TGTTCTTCGT GGACAACATT 700 

TACCGTTACA CTTTGGCCGG TACGGAAGTA TCCGCATTGC TGGGTCGTAT 750 

GCCTTCAGCA GTAGGTTACC AACCGACATT GGCTGAAGAA ATGGGTCGTT 800 

TGCAAGAGCG TATTACCTCT ACC CAGACAG GCTCTATTAC TTCC 844 



2) INFORMATION FOR SEQ ID NO: 750 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 834 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Neisseria flavescens 

(B) STRAIN: ATCC 13120 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 750 

CGCGACGCTA TTCCGCATGT TTACGATGCC CTGAAATTGG ACGAGAACGG 50 
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TCTGACTCTG GAAGTTCAAC 
TTGCAATGGG TAGTTCAGAC 
ACCGGTGCGC CAATCACTGT 
TGTCGACGTA TTGGGTACGC 
5 ACAAGAGCCG TGCCATTCAC 
TCAACTACCG AATTGTTGGA 
TCCGTTTGCT AAGGGCGGTA 
GCAAAACCGT GAACATGATG 
AGCGGCCTGT CCGTGTTCGC 

10 CGACTTCTAC CACGAGATGA 
TGGTGTATGG TCAAATGAAC 
TTGACCGGTT TGACTATGGC 
CGGTAAAGGT CGCGACGTAT 
CTCTGGCCGG TACCGAAGTA 

15 GTGGGTTACC AACCGACATT 
TATTACTTCT ACGCAAACCG 



AGCTTCTGGG TGACGGCGTT 
GGCCTGAAAC GCGGCATGTC 
GCCGGTAGGT AAAGGTACAT 
CTGTTGATGA AGCAGGTCCG 
CAAACTGCTC CGAAATTCGA 
AACCGGTATT AAAGTGATCG 
AAGTAGGTCT GTTCGGTGGT 
GAATTGATCA ACAACATCGC 
AGGTGTGGGT GAACGTACCC 
AAGATTCCAA CGTATTGGAT 
GAACCTCCAG GCAACCGTCT 
CGAATACTTC CGTGACGAAA 
TGTTCTTCGT TGACAACATC 
TCCGCACTGT TGGGTCGTAT 
GGCTGAAGAA ATGGGTCGTT 
GTTCCATTAC TTCC 



gtccgt£cta ioo- 

tgtaagcaat 150 

tgggccgtat 200 

atcgataccg 250 

tgagttgtct 300 

ACTTGCTGTG 350 

GCCGGTGTGG 4 00 

CAAAGCGCAC 450 

GTGAAGGTAA 5 00 

AAAGTAGCCA 550 
GCGCGTTGCT * 600 

AAGATGAAAA 650 

TACCGTTACA 700 

GCCTTCTGCA 75 0 

TGCAGGAGCG 8 00 

834 



2 0 2) INFORMATION FOR SEQ ID NO: 751 



25 



30 



35 



40 



45 



50 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 834 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear „ 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Neisseria gonorrhoeae 

(B) STRAIN: ATCC 31426 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 751 



CGCGACATGA 
TCTGACTTTG 
TCGCTATGGG 
ACTGGTTCGC 
TGTCGATGTA 
ACAAGAGTCG 
TCCACAACCG 
TCCGTTTGCC 
GTAAAACCGT 
AGCGGCTTGT 
CGACTTCTAC 
TGGTGTATGG 
TTGACCGGTT 
CGGTAAAGGT 
CTCTGGCCGG 
GTGGGTTACC 
TATTACCTCT 



TTCCGCGCGT 
GAAGTCCAAC 
CAGCTCGGAC 
CCATTACTGT 
TTGGGAACTC 
TGCCATCCAC 
AATTGCTCGA 
AAAGGCGGTA 
GAACATGATG 
CCGTGTTCTC 
CACGAGATGA 
CCAAATGAAC 
TGACTATGGC 
CGTGACGTAT 
TACCGAAGTA 
AACCGACATT 
ACCCAAACCG 



TTACGACGCT 
AGCTTTTGGG 
GGTTTGAAAC 
GCCGGTAGGT 
CTGTTGACGA 
CAAGCCGCTC 
AACGGGCATT 
AAGTAGGTCT 
GAATTGATCA 
AGGCGTGGGT 
AAGATTCCAA 
GAACCTCCAG 
TGAATACTTC 
TGTTCTTCGT 
TCCGCACTGT 
GGCTGAAGAA 
GTTCCATTAC 



TTGAAATTAG 
TGATGGCGTA 
GCGGCATGAC 
AAAGGTACGT 
GGCAGGTCCA 
CTAAGTTTGA 
AAAGTGATTG 
GTTCGGCGGT 
ACAACATCGC 
GAGCGTACCC 
CGTATTGGAT 
GCAACCGTCT 
CGTGACGAAA 
TGACAACATC 
TGGGCCGTAT 
ATGGGTCGTT 
TTCC 



ACGAAAACGG 50 

GTCCGTACCA 100 

TGTGAGCAAT 150 

TGGGACGCAT 200 

ATTGATACCG 250 

CGAACTGTCT 300 

ACTTGCTGTG 3 50 

GCCGGTGTGG 4 00 

CAAAGCGCAC 450 

GCGAAGGTAA 500 

AAAGTAGCCA 550 

GCGCGTTGCT 600 

AAGACGAAAA 650 

TACCGTTACA 700 

GCCTTCTGCA 750 

TGCAAGAGCG 800 

834 



55 

2) INFORMATION FOR SEQ ID NO: 752 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 825 bases 
60 (B> TYPE: Nucleic acid 
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(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

5 

(vi) ORIGINAL SOURCE : 

(A) ORGANISM: Neisseria, gonorrhoeae 

(B) STRAIN: ATCC 27628 

10 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 752 

ACATGATTCC GCGCGTTTAC GACGCTTTGA AATTAGACGA AAACGGTCTG 50 

ACTTTGGAAG TCCAACAGCT TTTGGGTGAT GGCGTAGTCC GTACCATCGC 100 

TATGGGCAGC TCGGACGGTT TGAAACGCGG CATGACTGTG AGCAATACTG 150 

15 GTTCGCCCAT TACTGTGCCG GTAGGTAAAG GTACGTTGGG ACGCATTGTC 2 00 

GATGTATTGG GAACGCCTGT TGACGAGGCA GGTCCAATTG ATACCGACAA 250 

GAGTCGTGCC ATCCACCAAG CCGCTCCTAA GTTTGACGAA CTGTCTTCCA 3 00 

CAACCGAATT GCTCGAAACG GGCATTAAAG TGATTGACTT GCTGTGTCCG 350 

TTTGCCAAAG GCGGTAAAGT AGGTCTGTTC GGCGGTGCCG GTGTGGGTAA 4 00 

2 0 AACCGTGAAC ATGATGGAAT TGATCAACAA CATCGCCAAA GCGCACAGCG 45 0 

GCTTGTCCGT GTTCTCAGGC GTAGGTGAGC GTACCCGCGA AGGTAACGAC 500 

TTCTACCACG AGATGAAAGA TTCCAACGTA TTGGATAAAG TAGCCATGGT 550 

GTATGGCCAA ATGAACGAAC CTCCAGGCAA CCGTCTGCGC GTTGCTTTGA 60 0 

CCGGTTTGAC TATGGCTGAA TACTTCCGTG ACGAAAAAGA CGAAAACGGT 650 

25 AAAGGTCGTG ACGTATTGTT CTTCGTTGAC AACATCTACC GTTACACTCT 700 

GGCCGGTACC GAAGTATCCG CACTGTTGGG CCGTATGCCT TCTGCAGTGG 750 

GTTACCAACC GACATTGGCT GAAGAAATGG GTCGTTTGCA AGAGCGTATT 800 

ACCTCTACCC AAACCGGTTC CATTA 825 



30 



45 



2 ) INFORMATION FOR SEQ ID NO: 753 



(i) SEQUENCE CHARACTERISTICS: 
35 (A) LENGTH: 831 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

40 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Neisseria, lactamica 

(B) STRAIN: ATCC 23970 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 753 



GATGCGATTC CGCATGTTTA CGATGCCCTG AAATTGGACG AGAACGGTCT 50 

GACCCTGGAA GTCCAACAGC TTTTGGGTGA CGGCGTTGTC CGTACTATTG 100 

50 CAATGGGTAG TTCAGACGGC CTGAAACGCG GCATGTCTGT CAGCAATACC 150 

GGTGCGCCAA TCACTGTGCC GGTAGGTAAA GGTACGTTGG GCCGTATTGT 2 00 

CGACGTATTG GGTACACCTG TTGACGAAGC AGGTCCGATC GATACCGACA 250 

AGAGCCGCGC CATCCACCAA ACCGCCCCGA AATTCGACGA GTTGTCTTCA 300 

ACTACCGAAT TGTTGGAAAC CGGCATTAAA GTGATCGATT TGCTGTGTCC 350 

55 GTTTGCTAAG GGCGGTAAAG TAGGTCTGTT CGGTGGTGCC GGTGTGGGCA 400 

AAACCGTGAA CATGATGGAA TTGATCAACA ACATCGCCAA AGCGCACAGC 450 

GGTCTGTCCG TGTTTGCAGG CGTGGGCGAG CGTACCCGCG AAGGTAACGA 500 

CTTCTACCAC GAGATGAAAG ATTCCAACGT ATTGGATAAA GTAGCCATGG 550 

TGTATGGTCA GATGAACGAA CCTCCGGGCA ACCGTCTGCG CGTTGCTTTG 600 

60 ACCGGTTTGA CGATGGCCGA ATACTTCCGC GACGAAAAAG ACGAAAACGG 650 
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CAAAGGCCGC GACGTATTGT TCTTCGTGGA 

TGGCCGGTAC CGAAGTATCC GCACTGTTGG 

GGTTACCAAC CGACATTGGC TGAAGAAATG 

TACCTCTACC CAAACCGGTT CCATTACTTC 



CAACATCTAC CGTTACACCC 7O0 

GCCGTATGCC TTCCGCAGTG 750 

GGTCGTTTGC AAGAGCGTAT 800 

C 831 



10 



15 



20 



2) INFORMATION FOR SEQ ID NO: 754 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 83 6 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Neisseria meningitidis 

(B) STRAIN: 2241C 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 754 



CCACGCGACA 

25 CGGTCTGACT 
CCATTGCGAT 
AATACCGGTG 
CATTGTCGAT 
CCGACAAGAG 

30 TCTTCCACAA 
GTGTCCGTTT 
TGGGTAAAAC 
CACAGCGGCT 
TAACGACTTC 

35 CAATGGTTTA 
GCATTGACCG 
AAACGGCAAA 
ACACTCTAGC 
GCAGTGGGCT 

40 GCGTATTACC 



TGATTCCGCG 
TTGGAAGTCC 
GGGCAGCTCG 
CGCCCATTAC 
GTATTGGGAA 
CCGTGCCATC 
CCGAATTGCT 
GCCAAAGGCG 
CGTGAACATG 
TGTCCGTGTT 
TACCACGAGA 
CGGTCAGATG 
GTTTGACCAT 
GGCCGCGACG 
TGGTACCGAA 
ACCAACCGAC 
TCTACCCAAA 



CGTTTACGAC 
AACAGCTTTT 
GACGGTTTGA 
TGTGCCGGTA 
CGCCTGTTGA 
CACCAAGCCG 
CGAAACGGGC 
GTAAAGTAGG 
ATGGAATTGA 
CGCAGGCGTG 
TGAAAGATTC 
AACGAACCTC 
GGCGGAATAC 
TATTGTTCTT 
GTATCCGCAT 
ATTGGCAGAA 
CCGGTTCTAT 



GCTTTGAAAT 
GGGCGACGGC 
AACGCGGCAT 
GGTAAAGGTA 
CGAGGCAGGT 
CTCCTAAGTT 
ATTAAAGTGA 
TCTGTTCGGC 
TCAACAACAT 
GGTGAGCGTA 
CAACGTATTG 
CGGGCAACCG 
TTCCGTGACG 
CGTGGACAAC 
TGTTGGGCCG 
GAAATGGGTC 
TACTTC 



TAGACGAAAA 50 

GTAGTCCGTA 100 

GACTGTGAGC 150 

CGTTGGGACG 200 

CCAATCGATA 2 50 

TGACGAACTG 3 00 

TTGACTTGCT 350 

GGTGCCGGTG 4 00 

CGCCAAAGCG 450 

CCCGCGAAGG 500 

GATAAAGTGG 550 

TTTGCGCGTC 600 

AAAAAGACGA 650 

ATCTACCGTT 700 

TATGCCGTCT 750 

GTTTGCAGGA 800 

836 



45 



2) INFORMATION FOR SEQ ID NO: 755 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 837 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
50 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
55 (A) ORGANISM: Neisseria mucosa 

(B) STRAIN: ATCC 19696 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 755 

60 CCGCGTGATG CCATTCCGCG TGTTTACGAC GCCCTGAAAT TGGATGCAAA 50 
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CGGCCTGACT TTGGAAGTGC AACAGCTTCT 

CTATTGCAAT GGGTAGTTCG GATGGTTTGA 

AATACAGGTG CGCCGATTAC AGTACCGGTA 

TATTGTCGAT GTATTGGGTA CGCCTGTTGA 

5 CCGACAAACA CCGTGCCATC CATCAGACAG 

TCTGCGACTA CTGAGCTGCT GGAAACCGGC 

GTGTCCGTTT GCCAAAGGCG GTAAAGTAGG 

TAGGCAAAAC CGTCAACATG ATGGAATTAA 

CATAGCGGTT TGTCCGTGTT TGCAGGTGTG 

10 TAATGACTTC TACCACGAGA TGAAAGATTC 

CGATGGTTTA CGGTCAGATG AACGAACCTC 

GCCTTGACCG GTTTGACGAT GGCCGAATAC 

AAGCGGCAAA GGCCGTGACG TATTGTTCTT 

ACACCCTGGC CGGTACCGAA GTATCCGCAT 

15 GCAGTAGGTT ACCAACCGAC ATTGGCTGAA 

GCGTATTACC TCTACCCAAA CAGGCTCCAT 



PCT/CA00/01150 

GGGTGACGGC GTAGTTCGTA ^00 

AACGCGGCAT GACTGTAAGC 150 

GGTAAAGGTA CTTTGGGACG 2 00 

CGAAGCAGGT CCGATTGATA 250 

CTCCGAAATT TGATGAACTG 3 00 

ATTAAAGTGA TTGACTTGCT 350 

TCTGTTCGGT GGTGCCGGTG 400 

TTAACAACAT CGCCAAAGCA 450 

GGTGAGCGTA CCCGTGAAGG 500 

CAA.CGTATTG GACAAAGTGG 550 

CGGGTAACCG TCTGCGTGTA 600 

TTCCGTGATG AAAAAGACGA 650 

CGTGGACAAT ATCTACCGTT 7 00 

TGTTGGGTCG TATGCCTTCA 750 

GAAATGGGTC GTTTGCAAGA 8 00 

TACCTCC 837 



20 2) INFORMATION FOR SEQ ID NO: 756 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 834 bases 

(B) TYPE: Nucleic acid 
25 (C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

30 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Neisseria subflava 

(B) STRAIN: ATCC 14221 



35 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 756 



CGCGACGCTA TCCCGCATGT TTACGATGCC CTGAAATTGG ACGAGAACGG 50 

TCTGACTCTG GAAGTTCAAC AGCTTCTGGG TGACGGCGTT GTCCGTACTA 100 

TTGCAATGGG TAGTTCAGAC GGCCTGAAAC GCGGCATGTC TGTAAGCAAT 150 

ACTGGTGCGC CAATCACTGT GCCGGTAGGT AAAGGTACAT TGGGTCGTAT 2 00 

40 TGTCGACGTA TTGGGTACGC CTGTTGATGA AGCAGGTCCG ATCGATACCG 2 50 

ACAAGAGCCG TGCCATTCAC CAAACTGCTC CGAAATTCGA CGAGTTGTCT 3 00 

TCAACTACCG AATTGCTGGA AACCGGTATT AAAGTGATCG ACTTGCTGTG 350 

TCCGTTTGCT AAGGGCGGTA AAGTAGGTCT GTTCGGTGGT GCCGGTGTGG 4 00 

GCAAAACCGT GAACATGATG GAATTGATCA ACAACATCGC CAAAGCGCAC 4 50 

45 AGCGGTCTGT CCGTGTTCGC AGGTGTGGGT GAACGTACCC GTGAAGGTAA 500 

CGACTTCTAC CACGAGATGA AAGATTCCAA CGTATTGGAT AAAGTAGCCA 550 

TGGTGTATGG CCAAATGAAC GAACCTCCGG GCAACCGTCT GCGCGTTGCT 600 

TTGACCGGTT TGACTATGGC CGAATACTTC CGTGACGAAA AAGACGAAAA 650 

CGGTAAAGGT CGCGACGTAT TGTTCTTCGT TGACAACATC TAC CGTTAC A 7 00 

50 CTCTGGCCGG TACCGAAGTA TCTGCACTGT TGGGCCGTAT GCCTTCTGCA 750 

GTGGGTTACC AACCGACATT GGCTGAAGAA ATGGGTCGTT TGCAAGAGCG 8 00 

TATTACCTCT ACCCAAACTG GTTCCATTAC TTCC 834 



55 

2) INFORMATION FOR SEQ ID NO: 757 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 833 bases 
60 (B) TYPE: Nucleic acid 
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(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

5 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Neisseria weaveri 

(B) STRAIN: ATCC 51223 

10 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 757 



CCCGTGATGC 
ATCTGACCCT 
ATTGCAATGG 

15 TACCGGCGCT 
TTATGGATGT 
GAAGAAACTC 
TTCAGCAACT 
GCCCGTTTGC 

20 GGTAAAACCG 
TAGTGGTTTG 
ACGACTTCTA 
ATGGTTTATG 
TTTGACTGGT 

25 ACGGCAAAGG 
ACTCTGGCCG 
AGTAGGTTAT 
GTATTACTTC 



TATTC CCATG 
GGAAGTGCAA 
GTAGTTCAGA 
CCGATTACTG 
GTTGGGTAAT 
GCGCTATTCA 
GAGTTGTTGG 
CAAAGGTGGT 
TAAATATGAT 
TCTGTATTCG 
CCATGAAATG 
GCCAGATGAA 
TTGACTATGG 
TCGTGACGTC 
GTACTGAAGT 
CAGCCTACAT 
TACTCAAACA 



TATACGATGC 
CAACTTTTAG 
CGGCCTAAAA 
TTCCGGTGGG 
CCGGTTGATG 
TCAAGCTGCC 
AAACAGGCAT 
AAAGTAGGTT 
GGAGTTGATC 
CCGGTGTAGG 
AAAGACTCTA 
TGAACCTCCG 
CCGAATATTT 
TTGTTCTTTG 
GTCTGCACTG 
TGGCAGAAGA 
GGTTCGATTA 



CCTGAAATTG 
GTGATGGTGT 
CGTGGTATGG 
GAAAGCCACC 
AAGCAGGTCC 
CCTAAATTTG 
TAAAGTAATT 
TGTTTGGTGG 
AACAACATCG 
TGAGCGTACC 
ACGTATTGGA 
GGTAAC CGTT 
CCGTGACGAG 
TGGATAATAT 
TTAGGTCGTA 
AATGGGTCGC 
CTT 



GTAGATAACG 50 

GGTTCGTACC 100 

CTGTTAACAA 150 

TTGGGACGTA 2 00 

TGTTGTATCA 250 

ACGAACTGTC 300 

GACTTGCTGT 350 

TGCCGGCGTG 4 00 

CGAAGGCACA 4 50 

CGTGAAGGTA 500 

TAAAGTAGCC 550 

TGCGCGTTGC 600 

AAAGATGAAA 650 

CTATCGCTAT 700 

TGCCGTCTGC 750 

TTGCAGGAGC 800 

833 



30 



35 



40 
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2) INFORMATION FOR SEQ ID NO: 758 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 833 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Neisseria animalis 

(B) STRAIN: ATCC 19573 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 758 



CGCGATGCCA 
TCTGACTTTG 

50 TTGCAATGGG 
ACCGGTTCTC 
TATGGACGTA 
AAGAGAAACG 
TCAGCTACCG 

55 TCCGTTTGCA 
GCAAAACCGT 
AGCGGTCTGT 
CGACTTCTAC 
TGGTGTACGG 

60 TTGACCGGCC 



TTCCTCACGT 
GAAGTACAAC 
TAGTTCAGAC 
CGATTGCCGT 
TTGGGCAATC 
TGCTATTCAC 
AGTTGTTGGA 
AAAGGCGGTA 
AAACATGATG 
CTGTGTTTGC 
CACGAGATGA 
TCAGATGAAT 
TGACTATGGC 



TTACGATGCC CTGAAATTGG 
AACTTCTGGG CGACGGTGTG 
GGCCTGAAAC GGGGTTTGTC 
TCCTGTCGGT AAAGCGACTT 
CGGTTGATGA AGCCGGTCCG 
CAAGCCGCAC CGAAGTTTGA 
AACCGGTATT AAAGTAATCG 
AAGTAGGTCT GTTCGGCGGT 
GAATTGATCA ACAACATCGC 
CGGTGTAGGT GAACGTACCC 
AAGATTCCAA CGTGTTGGAT 
GAGCCGCCGG GCAACCGCTT 
CGAATACTTC GTGACGAAAA 
432 



ACGACACCGG 50 

GTACGTACCA 100 

TGTGAGCAAT 150 

TGGGTCGTAT 2 00 

GTTGCTACCG 250 

TGAATTGTCA 3 00 

ACTTGCTGTG 350 

GCCGGTGTGG 4 00 

CAAAGCACAC 450 

GCGAAGGTAA 500 

AAAGTAGCCA 550 

GCGCGTGGCT 600 

AGACGAAAAC 650 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 



PCT/CA00/01150 



GGCAAAGGTC GTGACGTATT 
ACTGGCCGGT AC CGAAGTAT 
TAGGTTATCA GCCGACATTG 
ATTACCTCTA CCCAAACCGG 



GTTCTTCGTG GACAACATTT 

CAGCATTGTT GGGCCGTATG 

GCAGAGGAAA TGGGTCGCTT 

TTCGATTACC TCT 



ACCGCTACAC 7 0 Q/ 

CCGTCTGCAG 750 
GCAAGAGCGT 800 

833 



2) INFORMATION FOR SEQ ID NO: 759 

10 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 819 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



15 



(ii) MOLECULE TYPE: Genomic DMA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Proteus penneri 
20 (B) STRAIN: ATCC 33519 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 759 

AGCGTCCCTA AAGTATACGA CGCTCTTGAG GTTATGAATG GTAAAGAAAA 50 

25 ACTGGTGCTG GAAGTTCAGC AACAGTTAGG CGGTGGTATC GTTCGTTGTA 10 0 

TCGCAATGGG TACATCAGAC GGTTTAAGCC GTGGCTTAAA AGTTGAAGAT 150 

TTAGGCCACC CAATTGAAGT ACCAGTAGGT AAAGCAACAC TGGGACGTAT 200 

CATGAACGTT CTGGGTACAC CTATTGATAT GAAAGGTGAT ATTGCAACTG 250 

AAGAACGTTG GTCTATTCAC CGTGAAGCAC CAACCTACGA AGAGTTATCA 300 

30 AACTCACAAG AACTGCTTGA AACCGGTATC AAAGTAATGG ACTTAATCTG 350 

TCCGTTTGCT AAAGGTGGTA AAGTAGGTCT GTTCGGTGGT GCGGGTGTGG 400 

GTAAAACAGT TAACATGATG GAATTGATCC GTAATATCGC GATCGAGCAC 450 

TCAGGTTACT CTGTATTTGC TGGTGTTGGT GAGCGTACTC GTGAGGGTAA 500 

CGACTTCTAT CATGAAATGA CAGATTCTAA CGTTCTTGAC AAAGTATCGT 550 

35 TAGTTTATGG TCAGATGAAT GAGCCACCAG GAAACCGTCT GCGTGTAGCA 600 

CTGACTGGTC TGACTATGGC TGAAAAATTC CGTGATGAAG GCCGTGACGT 650 

ACTGTTATTC GTCGATAACA TTTATCGTTA CACCTTAGCC GGTACAGAAG 700 

TATCAGCACT GTTAGGTCGT ATGCCATCAG CGGTAGGTTA CCAGCCAACA 750 

TTGGCTGAAG AGATGGGTGT TCTGCAAGAA CGTATCACTT CAACCAAAAC 800 

40 AGGTTCAATC ACCTCTGTA 819 



45 



60 



2) INFORMATION FOR SEQ ID NO: 760 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 819 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
50 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
55 (A) ORGANISM: Salmonella choleraesuis subsp. choleraesuis 

serotype Enteritidis 
(B) STRAIN: ATCC 13076 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 760 

433 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 

GCCGTACCAC GCGTGTACGA 
GCTGGTGCTG GAAGTTCAGC 
TCGCGATGGG GTCTTCTGAC 
CTCGAACACC CGATCGAAGT 
5 CATGAACGTC CTGGGCGAAC 
AAGAGCGTTG GGCGATTCAC 
AACTCTCAGG AACTGCTGGA 
TCCGTTCGCG AAGGGCGGTA 
GTAAAACCGT AAACATGATG 

10 TCCGGTTACT CAGTGTTTGC 
CGACTTCTAC CACGAAATGA 
TGGTGTATGG CCAGATGAAC 
CTGACCGGCC TGACCATGGC 
ACTGCTGTTC GTCGATAACA 

15 TATCCGCACT GCTGGGCCGT 
CTGGCGGAAG AGATGGGCGT 
CGGTTCTATC ACCTCCGTA 



TGCCCTTGAG GTGCAGAATG 
AGCAGCTTGG CGGCGGTATC 
GGTCTGCGTC GCGGTCTGGA 
CCCGGTAGGT AAAGCCACGC 
CGGTCGACAT GAAAGGCGAG 
CGCGCAGCAC CTTCCTACGA 
AACCGGTATC AAAGTTATCG 
AAGTCGGTCT GTTCGGTGGT 
GAGCTTATTC GTAACATCGC 
GGGCGTAGGG GAACGTACTC 
CCGACTCCAA CGTTATCGAT 
GAGCCGCCGG GAAACCGTCT 
GGAGAAATTC CGTGACGAAG 
TCTATCGTTA CACCCTGGCC 
ATGCCTTCCG CAGTAGGTTA 
TCTGCAGGAA CGTATCACCT 



PCT/CA00/01150 

gtaAtgagaa" -£0 

GTGCGTACCA 100 

TGTAAAAGAT 150 

TGGGTCGTAT 200 

ATCGGCGAAG 250 

AGAGTTGTCA 300 

AC CTG ATGTG 350 

GCGGGTGTAG 40 0 

GATCGAGCAC 450 

GTGAGGGTAA 500 

AAAGTATCCC 55 0 

GCGCGTTGCA 600 

GTCGTGACGT 650 

GGTACGGAAG ' 700 

CCAGCCGACT 750 

CCACCAAAAC 800 

819 



20 



30 



55 



2) INFORMATION FOR SEQ ID NO: 761 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 812 bases 
25 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Yersinia pestis 

(B) STRAIN: KIM D27 

35 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 761 

ACCAAAAGTG TACAACGCCC TTGAGGTAGA AGGTACAACT GAAAAGTTAG 50 

TGCTGGAAGT TCAGCAACAG TTGGGCGGTG GTGTTGTTCG TTGTATCGCA 100 

ATGGGCTCTT CCGATGGTCT GAGC CGTGGG TTGAAAGTAA CCAACCTAGA 150 

40 ACACCCGATC GAAGTAC CGG TTGGTAAAGC GACCCTTGGC CGTATCATGA 2 00 

ACGTATTGGG TGAACCAATC GACATGAAAG GTCCTATCGG TGAAGAAGAG 250 

CGTTGGGCAA TCCATCGCGA AGCGCCTTCT TATGAAGAGC TTGCTAGCTC 3 00 

ACAAGATCTG TTAGAAACCG GTATCAAGGT TATGGACCTG ATTTGTCCGT 35 0 

TTGCTAAGGG CGGTAAAGTC GGTCTGTTCG GTGGTGCGGG TGTAGGTAAA 400 

45 ACAGTAAACA TGATGGAGCT GATCCGTAAC ATCGCGATCG AGCACTCTGG 450 

GTATTCTGTA TTTGCCGGTG TAGGTGAGCG TACCCGTGAG GGTAATGACT 500 

TCTACCATGA AATGACTGAC TCCAACGTTT TGGACAAAGT ATCCTTGGTT 550 

TACGGCCAGA TGAATGAGCC ACCAGGTAAC CGTCTACGCG TTGCACTGAC 60 0 

CGGCCTGACC ATGGCGGAGA AATTCCGTGA TGAAGGTCGT GACGTACTGC 650 

50 TGTTTATCGA TAATATCTAT CGTTATACCC TAGCTGGTAC GGAAGTATCC 70 0 

GCATTGCTGG GTCGTATGCC ATCAGCGGTA GGTTATCAGC CAACACTGGC 750 

TGAAGAGATG GGTGTGTTGC AGGAACGTAT TACTTCCACT AAGACGGGTT 800 

CAATCACCTC TG 812 



2) INFORMATION FOR SEQ ID NO: 762 



(i) SEQUENCE CHARACTERISTICS: 
60 (A) LENGTH: 408 bases 

434 



SUBSTITUTE SHEET (RULE 26) 



10 



WO 01/23604 PCT/CA00/01150 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY : Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Burkholderia mallei 

(B) STRAIN: GB8 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 762 



CGACGTGATC GAGCCCTTCT TCGTCGACGT GATGCGCTCT TGCAGCTTGC 50 

CCATTTCTTC AGCCAGCGTC GGCTGATAGC CCACTGCCGA CGGCATACGG 100 

15 CCGAGCAGCG CCGACACTTC GGTACCGGCC AGCGTGAAAC GGTAGATGTT 150 

GTCGACGAAG AACAGCACGT CGAGGCCTTC GTCACGGAAG TGCTCGGCCA 200 

TCGTGAGGCC CGTCAGCGCC ACGCGCAGAC GGTTGCCCGG CGGCTCGTTC 250 

ATCTGGCCGT ACACCAGCGC GACCTTGTCG AGAACGTTCG AGTCCTTCAT 300 

TTCGTGGTAG AAGTCGTTCC CTTCACGGGT ACGCTCGCCC ACGCCCGCGA 350 

2 0 ACACGGAGTA ACCGCCGTGC TCCTTCGCGA TGTTGTTGAT GAGCTCCATC 400 

ATGTTGAC 408 



25 2) INFORMATION FOR SEQ ID NO: 763 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 40 0 bases 

(B) TYPE: Nucleic acid 
30 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

35 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Clostridium sordellii 

(B) STRAIN: ATCC 9714 



40 



50 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 763 



GAACTTATAA ACAACATAGC TACTCAACAT GGTGGTATAT CAGTATTCGC 50 

AGGTGTTGGA GAGAGAACAA GAGAAGGTAA CGACCTTTAC GGAGAAATGA 10 0 

GTGAGTCTGG AGTTATAAAC AAGACAGCTC TAGTATTCGG ACAAATGAAT 150 

GAGCCACCTG GAGCAAGAAT GAGAGTTGCT TTAACTGGTC TTACAATGGC 200 

45 TGAATATTTC AGAGATCAAG AAGGACAAGA CGTTTTATTA TTCGTAGATA 250 

ATATATTCCG TTTCACTCAA GCAGGATCTG AGGTTTCTGC ACTTCTTGGA 300 

CGTACTCCAT CAGCAGTTGG ATACCAACCA ACATTAGCTA CAGAGATGGG 350 

TAGATTACAA GAGAGAATAA CATCTACAAA TAAAGGGTCT ATAACATCAG 400 



2) INFORMATION FOR SEQ ID NO: 764 



(i) SEQUENCE CHARACTERISTICS: 
55 (A) LENGTH: 4 05 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

60 (ii) MOLECULE TYPE: Genomic DNA 

435 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 



PCT/CA00/01 150 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Clostridium novyi 

(B) STRAIN: ATCC 194 02 

5 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 764 

TTAATTCAAG AATTAATCAA CAATATAGCG AAGGAACACG GTGGATTATC 50 

TGTATTTACA GGAGTTGGAG AAAGAACAAG AGAAGGTAAT GACCTTTACT 100 

10 ATGAAATGAA AGAATCTGGT GTTATAAATA AAACAGCACT AGTATTTGGT 150 

CAAATGAATG AGCCACCTGG AGCAAGAATG AGAGTTGCTC TTACAGGACT 2 00 

TACTATGGCA GAATATTTCA GGGATCAAGG ACAAAACGTA CTTTTATTCA 250 

TAGACAACAT ATTTAGATTC ACTCAAGCAG GTTCAGAGGT GTCAGCTTTA 3 00 

CTTGGAAGAA TACCTAGTGC CGTTGGATAC CAACCAACAC TAGCAACAGA 350 

15 AATGGGTGCG CTTCAAGAAA GAATAACATC TACAAAGCAT GGTTCTATAA 4 00 

CATCA 405 



20 2) INFORMATION FOR SEQ ID NO: 765 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 93 bases 

(B) TYPE: Nucleic acid 
25 (C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

3 0 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Clostridium botulinum 

(B) STRAIN: 20:3,1 



35 



45 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 765 



AATTAATAAA TAACATAGCT AAAGAACATG GTGGATTATC AGTGTTTACT 50 

GGAGTTGGAG AAAGAT CTAG AGAAGGTAAT GATCTTTATC ATGAAATGAG 10 0 

AGAATCAGGC GTTATAGATA AAACCGCATT AGTATTTGGT CAAATGAATG 150 

AGCCACCTGG TGCTAGAATG AGAGTTGCAT TAACAGGGTT AACTATGGCT 2 00 

40 GAATATTTTA GAGATAAAGG TCAAGATGTA TTACTATTTA TAGATAATAT 250 

ATTTAGATTT ACTCAAGCTG GATCAGAAGT TTCAGCATTA CTTGGAAGAA 300 

TACCATCAGC AGTTGGTTAC CAACCAACTT TAGCAACTGA AATGGGTGCA 3 50 

CTTCAAGAGA GAATTACATC AACTAAGAAT GGTTCTATAA CAT 393 



2) INFORMATION FOR SEQ ID NO: 766 



(i) SEQUENCE CHARACTERISTICS: 
50 (A) LENGTH: 3 99 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY : Linear 

55 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Clostridium histolyticum 

(B) STRAIN: ATCC 19401 

60 

436 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 
(xi) SEQUENCE DESCRIPTION: SEQ 

AGGAACTTAT AAATAATATT GCAAAACAAT 
ACAGGTGTTG GTGAAAGAAC AAGAGAAGGT 
5 GAAAGATTCT GGGGTTATAG AGAAAACTGC 
ATGAACCACC AGGAGCGAGA ATGAGAGTAG 
GCAGAATATT TTAGAGATCA AGGGCAAGAT 
TATATTTAGA TTTACGCAGG CTGGTTCTGA 
GAATTCCAAG TGCAGTTGGA TATCAACCAA 
10 GCATTACAAG AAAGAATAAC ATCCACAAAA 



PCT/CAOO/01150 

ID NO: 766 

ATGGAGGTCT ATCTGTATTT 50 

AATGACCTAT ATAATGAAAT 100 

ACTAGTATTT GGACAGATGA 150 

CATTGACAGG ACTTACTATG 2 00 

GTACTTTTAT TTATAGATAA 2 50 

AGTTTCTGCA TTGTTAGGAA 3 00 

CCCTTGCAAC TGAAATGGGT 350 

AATGGATCAA TTACTTCAG 3 99 



2) INFORMATION FOR SEQ ID NO: 767 



15 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 390 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 
20 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE : 
25 (A) ORGANISM: Peptostreptococcus prevotii 

(B) STRAIN: ATCC 9321 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 767 

3 0 CATGATGGAA CTGATC CGTA ACATCGCCAT CGAGCACAGC GGTTATTCCG 50 

TGTTCGCCGG TGTGGGTGAG CGTACTCGTG AGGGTAACGA CTTCTACCAC 100 

GAGATGAAGG ACTCCAACGT TCTGGACAAA GTGGCACTGG TCTACGGTCA 150 

GATGAACGAG CCGCCGGGTA ACCGTCTGCG CGTAGCACTG ACTGGCCTGA 2 00 

CCATGGCCGA GAAGTTCCGT GACGAAGGTA ACGACGTTCT GTTGTTCGTC 2 50 

35 GACAACATCT ACCGTTACAC CCTGGCCGGT ACTGAAGTAT CCGCACTGCT 3 00 

GGGCCGTATG CCTTCGGCAG TAGGTTACCA GCCGACCCTG GCTGAAGAGA 3 50 

TGGGCGTTCT GCAAGAACGT ATCACTTCGA CCAAGGAAGG 3 90 



40 



50 



2) INFORMATION FOR SEQ ID NO: 768 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 953 bases 
45 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Absldia corymbifera 

(B) STRAIN: ATCC 46775 

55 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 768 

AGGTCTTGTT CGTGGCAAAA GGTCATTGAC ACTGGTGCTC CTATCACCAT 50 

TCCTGTTGGT AACGAAGTCC TTGGTCGTAT CATTAACGTC ATTGGTGAGC 100 

CCATTGATGA GCGTGGTCCY ATCAAGTCCA AGGCCACTCG TGCTATCCAC 150 

60 GCTGATGCTC CCGAGTTCGT TGATCAATCC CCCACTCCCG AGATTCTCGA 200 

437 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 

GACTGGTATC AAGGTTGTCG ATTTGCTTGC 
AGATTGGTCT TTTCGGTGGT GCCGGTGTCG 
GAATTGATCA ACAACATTGC CAAGGCTCAC 
TGGTGTCGGT GAACGTACTC GTGAAGGCAA 
5 TTTCCACTGG TGTCATCAAG CTTGAAGGTG 
TTTGGTCAAA TGAACGAACC CCCCGGAGCT 
TGGTTTGACC ATTGCCGAAT ACTTGCGTGA 
TGCTCTTCAT TGACAACATT TTCCGTTTCA 
TCCGCTTTGC TTGGTCGTAT TCCCTCTGCT 

10 YTCCACTGAT ATGGGTGGTA TGCAAGAGCG 
GTTCCATTAC CTCCGTGCAA GCTGTCTACG 
GATCCTGCTC CTGCTACTAC TTTTGCTCAC 
GTCTCGTTCC ATTGCTGAGT TGGGTATCTA 
ACTCCAAGTC CCGTATCTTG GATCCCCGTA 

15 TCTGTTGCCA CTGGTGTCCA ACAAATCCTT 
AGA 



PCT/CA00/01150 

TCCCTATGCT CGTGGtGGTA 25/J 

GCAAGACTGT CTTGATTCAA 3 00 

GGTGGTTACT CTATCTTCTG 350 

CGATTTGTAC CACGAAATGA 400 

AATCCAAGTG TGCTCTTGTC 450 

CGTGCCCGTG TTGC CTTGAC 500 

TGAGGAAGGT CAAGATGTGT 550 

CTCAAGCCGG TTCTGAAGTG 600 

GTCGGTTACC AACCCACTCT 650 

TATTACTACC ACCAAGAACG 700 

TCCCTGCTGA CGATTTGACT 750 

TTGGACGCCA CCACTGTGTT 800 

CCCTGCTGTC GATCCCCTTG 850 

TCGTTGGTGA TGAGCACTAC 900 

CAAAACTACA AGTCGCTTCA 950 

953 



20 2) INFORMATION FOR SEQ ID NO: 769 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1343 bases 

(B) TYPE: Nucleic acid 
25 (C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

30 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Alternaria alternate 

(B) STRAIN: ATCC 62099 



35 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 769 



CGCGGTTCCA AGGCCACCGA CACCGGTGCC CCCATCAAGA TTCCCGTTGG 50 

TCACGGTACC CTTGGTCGTA TCATGAACGT CACTGGTGAC CCCATTGACG 100 

AGCGTGGTCC CATCAAGGCC ACCAAGTACG CTCCCATCCA CGCCGACCCC 150 

CCGGAGTTCA CCGAGCAATC CACCTCCGCT GAGGTCCTCG TTAC CGGTAT 200 

40 CAAGGTTGTC GACCTGTTGG CTCCTTACGC TCGTGGTGGA AAGATTGGTC 250 

TCTTCGGAGG TGCTGGTGTC GGAAAGACTG TCTTCATTCA GGAGCTGATT 300 

GTAAGGAGAC ACACTGTCTA CTGGCTGAGC ATTAGCTAAC GGCAGGCAGA 350 

ACAACATCGC CAAGGCCCAC GGTGGTTTCT CTGTCTTCAC TGGTGTCGGT 400 

GAGCGTACCC GTGAGGGTAA CGATCTGTAC CACGAGATGC AGGAGACTTC 450 

45 CGTCATTCAG CTTGACGGTG ACTCCAAGGT CGCCCTCGTC TTCGGTCAGA • 500 

TGAACGAGCC CCCGGGTGCC CGTGCCCGTG TCGCTCTTAC TGGTCTTACT 550 

GTTGCTGAGT AAGTCTTGAA TTCACTGTGT TGACAACGTC GTGGCTAATG 600 

GGAAAAAAGA TACTTCCGTG ACGAAGAGGG ACAGGATGTG CTTCTCTTCA 650 

TCGACAGTAA GTGCTTGTAC GAACTGCCTG TGAGACATAC ACTGACTTCG 700 

50 GCAATAGACA TTTTCCGTTT CACCCAGGCC GGTTCCGAGG TGTCCGCTCT 750 

TCTTGGTCGT ATCCCCTCTG CCGTCGGTTA CCAGCCCACC CTCGCCATTG 800 

ACATGGGTGT CATGCAGGAG CGTATTACCA CCACCACCAA GGGTTCCATC 850 

ACCTCCGTCC AGGCCGTCTA CGTGCCCGCT GACGATTTGA CTGACCCTGC 900 

CCCCGCCACC ACCTTCGCCC ATTTGGACGC CACCACTGTC TTGTCCCGTG 950 

55 GTATCTCCGA GTTGGGTATC TACCCCGCCG TCGACCCTCT TGACTCCAAG 1000 

TCCCGTATGT TGGACCCCCG TGTCATTGGT CAGGACCACT ACGACACCGC 1050 

CACCCGCGTT CAGCAGATTC TCCAGGAGTA CAAGTCGCTC CAGGATATCA 1100 

TTGCCATTCT CGGTATGGAC GAGTTGTCGG AAGCTGACAA GCTTACCGTC 1150 

GAGCGTGCCC GTAAGATCCA GCGTTTCTTG AGCCAGCCTT TCGCTGTCGC 1200 

60 CCAGGTCTTC ACTGGTATTG AGGGCAAGCT TGTCGACCTC AAGGACACCA 1250 

438 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 PCT/CA00/01 150 

TCCGATCATT CAAGGCTATC TTGACTGGTG AGGGTGACGA CCTTCCCGAG 130 0 

GGTGAGTCTC GACTATCTCC GCATTCATAG CGTATAACTG ACA 1343 



2) INFORMATION FOR SEQ ID NO : 770 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 48 0 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Aspergillus 

(B) STRAIN: ATCC 26947 

(xi) SEQUENCE DESCRIPTION: SEQ 

TTCAGGAGTT GATTGTATGT TCACCTGCAA 
ACTCTTTTCT AACTCTTCAC AGAACAACAT 
ACTCTGTCTT CACTGGTGTC GGTGAGCGTA 
TACCACGAAA TGCAGGAGAC TGGTGTCATT 
GGTCGCCCTT GTCTTCGGTC AGATGAACGA 
GTGTCGCCCT TACCGGTCTG ACCATCGCCG 
GGTCAGGATG TGCTGCTCTT CATTGACAAC 
CGGTTCTGAG GTGTCTGCCC TTCTTGGTCG 
ACCAGCCCAC TCTGGCCGTC GACATGGGTG 
ACCACCACCA AGGGTTCCAT TACCTCCGTC 



flavus 



ID NO: 770 

CATAAGACTT CCCATTCTCC 50 

TGCCAAGGCT CACGGTGGTT 10 0 

CCCGTGAGGG TAACGATCTG 150 

CAGCTCGAGG GTGAATCTAA 2 00 

GCCCCCAGGT GCCCGTGCCC 250 

AGTACTTCCG TGACGAGGAA 300 

ATTTTCCGTT TCACCCAGGC 350 

TATCCCCTCC GCTGTCGGTT 400 

GTATGCAGGA GCGTATTACC 450 

480 



2 ) INFORMATION FOR SEQ ID NO: 771 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1174 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Mucor circinelloides 

(B) STRAIN: ATCC 3 8592 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 771 

CTCGAACAAG AYAACTTGCC YGCCATTTTG AACGCCCTTG AAGTCAAGGA 50 

YCACTCTGGT GGACGTCTCG TYCTCGAAGT CTCTCAACAT TTGGGTGAGA 100 

ACACTGTCCG TACTATTGCT ATGGATGGTA CTGAAGGTAA GTTATGTYCA 150 

TCCCANNGGA TACAGTCARA CAGMAATGTC TAGTGGTTAT AGCAGYAGCA 200 

GATGATTGAC CAATATGTTA GGTCTTGTCC GGTGGTCAAA AGGTTGTTGA 250 

CACTGGTGCT CCCATTACCA TCCCCGTTGG TAAGGAAGT C CTTGGTCGTA 3 00 

TCATCAACGT TATTGGTGAA CCCATTGATG AACGTGGTCC CATTGACGCC 350 

AAGACTCACC GTCCTATTCA CGCTGAAGCT CCCGAATTCG TTGATCAATC 4 00 

CCCCACTCCC GAAATCCTCG AGACTGGTAT CAAGGTY GTC GATTTGTTGG 450 

CTCCTTATGC TCGTGGTGGT AAGATTGGTC TCTTCGGTGG TGCTGGTGTC 500 

439 



SUBSTITUTE SHEET (RULE 26) 
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PCT/CA00/01150 



GGTAAGACTG TCTTGATTCA AGAATTGATT AACAACATYG CCAAGGCTCA 550 

CGGTGGTTAC TCTATCTTCT GTGGTGTCGG TGAACGTACT CGTGAGGGTA 600 

ACGATTTGTA CCATGAAATG ATTGAAACCG GTGTCATTCA ATTGGAAGGC 650 

GAGTCCAAGT GTGCTCTCGT CTTTGGTCAA ATGAACGAAC CCCCAGGTGC 700 

5 TCGTGCCCGT GTCGCTTTGA CTGGTTTGAC TATTGCTGAA TACTTCCGTG 750 

ATGATGAGGG TCAAGATGTC TTGCTTTTCA TTGATAACAT TTTCAGATTC 800 

ACTCAAGCTG GTTCTGAGGT ATCTGCCCTT TTGGGTCGTA TTCCTTCCGC 850 

TGTCGGTTAC CAACCCACTC TTTCCACYGA TATGGGTGGT ATGCAAGAGC 900 

GTATTACTAC CACCAAGAAC GGTTCCATTA CCTCCGTCCA AGCTGTCTAC 950 

10 GTCCCTGCTG ATGATTTGAC CGATCCTGCT CCTGCCACCA CTTTTGCTCA 1000 

CTTGGATGCC ACCACTGTCT TGTCTCGTTC CATCGCTGAA TTGGGTATCT 1050 

AYCCCGCTGT CGATCCTCTT GATTCCAAGT CTCGTATCCT CGATCCCCGT 1100 

ATTGTCGGTG ATGAGCACTA CAAGGTTGCC ACTGAAGTTC AACAAATCCT 1150 

CCAAAACTAC AAGTCTCTCC AAGA 1174 

15 

2) INFORMATION FOR SEQ ID NO: 772 

20 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 467 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



25 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Piedrala hortai 
30 (B) STRAIN: ATCC 24292 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 772 

AGGAGCTTAT CAACAACATC GCCAAGGCTC ACGGTGGTTA CTCCGTCTTC 50 

35 ACTGGTGTCG GTGAGCGTAC TCGTGAGGGT AACGATTTGT ACCACGAGAT 100 

GCAAGAGACT TCCGTCATTC AGCTTGACSG CGAGTCCAAG GTCGCTCTCG 150 

TGTTCGGTCA GATGAACGAG CCCCCGGGTG CCCGTGCCCG TGTTGCCCTG 2 00 

ACTGGTCTTA CCATCGCTGA GTACTTCCGT GATGCCGAGG GTCAGGATGG 250 

TAAGTTCTAT AACTCTTGTC GCAAAGGTTT CATTCTGGTC GCTAACTTGC 300 

40 TCAGTGCTCC TGTTCATCGA CAACATTTTC CGTTTCACCC AGGCTGGTAT 3 50 

GGAGGTGTCT GCCCTCCTCG GTCGTATTCC TTCTGCCGTC GGTTACCAAC 400 

CTACTCTCGC CGTCGACATG GGTGGTATGC AAGAGCGTAT TACCACTACC 4 50 

AAGAAGGGAT CCATTAC 467 

45 

2) INFORMATION FOR SEQ ID NO: 773 

(i) SEQUENCE CHARACTERISTICS: 
50 (A) LENGTH: 578 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

55 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Pseudallescheria boydii 

(B) STRAIN: ATCC 44331 

440 
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(xi) SEQUENCE DESCR] 

TTGCCAAGGC CCACGGTGGT 
ACCCGTGAGG GTAACGATCT 
5 TCAGCTTGAT GGCGAGTCCA 
AGCCCCCTGG TGCCCGTGCT 
GAGTACTTCC GTGATGAGGA 
TTATCTTCTT TGCCACCACC 
GGCATCGTTT GTAGGGCGGG 
10 TTGAAGCCGC GGATGCCCGA 
GACAACAGTG CTTCTCTTCA 
GTTCCGAGGT GTCTGCCCTT 
CAGCCCACTC TTGCCGTAGA 
CACCAAGAAG GGTTCGATTA 

15 



PTION: SEQ ID NO: 773 

TACTCTGTCT TCACTGGTGT 
GTACCACGAA ATGCAGGAAA 
AGGTCGCGCT TGTCTTCGGT 
CGTGTCGCTC TTACTGGTCT 
GGGTCAGGAT GGTAAGTTAT 
CCTCTACGAA TCCATGCCTC 
TCGGAGTTTG CGGCAATTTC 
TGTTTGACGC GTATCGATGC 
TTGACAACAT TTTCCGATTC 
CTCGGTCGTA TTCCCTCTGC 
CATGGGTGCC ATGCAGGAAC 
CCTCCGTC 



CGGTGAGCGT 50 

CCTCCGTCAT 100 

CAGATGAACG 150 

TACCGTTGCC 200 

ATCGTTTTTA 250 

CGTTGGTGAA 300 

TGCCGTCGGC 350 

TAACAACAAT 400 

ACCCAGGCCG 450 

CGTCGGTTAC 50 0 

GTATTACCAC 55 0 

578 



2) INFORMATION FOR SEQ ID NO: 774 

20 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1123 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



25 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Rhizopus oryzae 
30 (B) STRAIN: ATCC 56015 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 774 

AACTTACCYG CTATCTTGAA CGCTCTCGAA GTCCAAGATC ACTCTGGTGG 50 

35 ACGTCTTGTC CTTGAAGTTC GCTCAACACT TGGGTGAAAA TACTGTCCGT 100 

ACTATTGCTA TGGATGGTAC TGAAGGTAAG CTATACTATA ACCGTKTTAT 150 

CCGAGTATGA TATTAACTTG AAAAAAGGTC TCGTYCGTGG TCAAAAGGTT 2 00 

ATTGACACTG GTGCTCCCAT TACCATTCCT GYTGGTAAGG AAGTT CTCGG 250 

TCGTATCATT AACGTCATTG GTGAACCCAT CGATGAACGT GGTCCTATCA 300 

40 ACGCCAAGAG CCAACGTCCC ATTCACGCCG AAGCTCCCGA ATTCGTTGAC 350 

CAATCTCCTA CTCCCGAAAT TCTTGAAACT GGTATCAAGG TTGTCGACTT 400 

GTTGGCTCCT TATGCTCGTG GTGGTAAGAT TGGTCTTTTC GGTGGTGCTG 450 

GTGTCGGTAA GACTGTGTTG ATT CAAGAAT TGATTAACAA CATCGCCAAG 500 

GCTCACGGTG GTTACTCTAT TTTCTGTGGT GTCGGTGAAC GTACTCGTGA 550 

45 AGGTAACGAT CTTTACCACG AAATGATTGA AACTGGTGTC ATCAAGCTCG 600 

ATGGTGACTC CAAGTGTGCT CTTGTCTTTG GTCAAATGAA CGAACCCCCA 650 

GGAGCTCGTG CCCGTGTTGC CTTGACTGGT TTGACCATTG CTGAATACTT 700 

CCGTGATGAT GAAGGTCAAG ATGTGTTGCT TTTCATTGAT AACATTTTCC 750 

GTTTCACCCA AGCTGGTTCW GAAGTATCTG CCCTTTTGGG TCGTATTCCC 800 

50 TCCGCTGTCG GTTACCAACC CACTCTTTCT ACTGATATGG GTGGTATGCA 850 

AGAACGTATT ACAACCACCA AGAACGGTTC CATTACATCC GTCCAAGCTG 90 0 

TCTACGTCCC TGCTGATGAT TTGACCGATC CTGCTCCCGC CACCACTTTT 950 

GCTCACTTGG ATGCCACCAC TGTCTTGTCT CGTTCCATTG CCGAATTGGG 1000 

TATTTACCCT GCCGTCGATC CTCTTGAYTC CAAGTCTCGT ATCTTGGATC 1050 

55 CTCGTATCGT TGGTGACGAA CACTACAAGG TCGCTACCGA AGTTCAACAA 1100 

ATCCTTCAAA ACTACAAGTC TCT 1123 



60 2) INFORMATION FOR SEQ ID NO: 775 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 477 bases 

(B) TYPE: Nucleic acid 

5 (C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

10 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Scopulariopsis koningii 

(B) STRAIN: ATCC 38745 



15 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 775 



ATTCAGGAGC TCATCAACAA CATTGCCAAG GCTCACGGTG GTTACTCTGT 5 0 

GTTCACTGGT GTCGGTGAGC GTACCCGTGA GGGTAACGAT CTGTACCACG 10 0 

AAATGCAGGA GACTTCGGTC ATTCAGCTCG AGGGCGAGTC CAAGGTCGCG 150 

CTTGTGTTCG GTCAGATGAA CGAGCCCCCC GGTGCCCGTG CCCGTGTCGC 200 

20 CCTTACCGGT CTGACCGTTG CCGAGTACTT CCGTGACGAG GAGGGCCAGG 250 

ATGGTGAGTA ACCGACGAAG TCTGAGATCT TGT CGGGC AT TATTCTAACG 300 

ACAACTAGTG CTTCTCTTCA TCGACAACAT TTTCCGCTTC ACCCAGGCCG 350 

GTTCCGAGGT GTCCGCGCTT CTCGGCCGTA TCCCCTCTGC CGTCGGTTAC 40 0 

CAGCCCACCC TGGCCGTCGA CATGGGAGGT ATGCAGGAGC GTATT AC CAC 450 

25 GACTCAGAAG GGCTCGATTA CCTCGGT 477 



30 



2 ) INFORMATION FOR SEQ ID NO: 776 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 610 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
35 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
40 (A) ORGANISM: Trichophyton men ta.gr ophytes 

(B) STRAIN: ATCC 8125 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 776 

45 TCCGGAGTTG ATTGTAAGTC ATTTGAAACC CAGCCCCAAG AAACAGAAGC 50 

TAGGCGAAAA TTGGACAATT GAGCAATTTA GCCATTGGAG AAAAGAAATT 100 

TCGAGTATTA ATTGTTTTTA TAGAACAACA TTGCCAAGGC TCACGGTGGT 150 

TACTCTGTCT TCACTGGTGT CGGAGAGCGT ACCCGTGAAG GAAACGATCT 2 00 

CTACCATGAA ATGCAGGAGA CCCGTGTCAT TCAGCTTGAT GGCGAGTCCA 250 

50 AGGTCGCCCT GGTCTTCGGC CAGATGAACG AGCCCCCAGG TGCCCGTGCC 3 00 

CGTGTTGCTC TTACTGGTTT GACCATTGCT GAGTACTTCC GTGATGAGGA 350 

AGGTCAAGAC GGTGAGTTTY TTATGGATAA AARATTTTTT TTTTTTTTTT 400 

TTTTTTMAAR AAATTCATGT TCTAACAAAG TATATCCTAG TGCTTCTCTT 450 

CATCGACAAC ATTTTCCGTT TCACTCAGGC TGGTTCCGAA GTGTCTGCCC 500 

55 TGCTTGGTCG TATCCCATCT GCCGTCGGTT ACCAACCCAC TCTTGCCGTC 550 

GACATGGGTG GTATGCAGGA ACGTATTACC ACCACCAAGA AGGGATCCAT 600 

TACCTCCGTC 610 



60 
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2) INFORMATION FOR SEQ ID NO: 777 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 593 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Trichophyton tonsurans 

(B) STRAIN: ATCC 56185 

15 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 777 

GGAGTTGATT GTAAGTCATT TGAAACCCAG CCCCAAGAAA CAGAAGCTAG 50 

GTGAAAATTG GACAATTGAA CAATTTAGCC CTTGGAGAAA AGAAATTTCG 100 

AGTATTAATT ATTTTTATAG AACAACATTG CCAAGGCTCA CGGTGGTTAC 150 

20 TCTGTCTTCA CTGGTGTCGG AGAGCGTACC CGTGAAGGAA ACGATCTCTA 2 00 

CCATGAAATG CAGGAGACCC GTGTCATTCA GCTTGATGGC GAGTCCAAGG 250 

TCGCCCTGGT CTTCGGCCAG ATGAACGAGC CCCCAGGTGC CCGTGCCCGT 300 

GTTGCTCTTA CTGGTTTGAC CATTGCTGAG TACTTCCGTG ATGAGGAAGG 3 50 

TCAAGACGGT GAGTTTCTTA TGGATGAAAG ATTTTTTTTT TTCAAGAAAT 4 00 

25 TCATGTTCTA ACAAAGTGTA TTCTAGTGCT TCTCTTCATC GACAACATTT 450 

TCCGTTTCAC TCAGGCTGGT TCCGAAGTGT CTGCCCTGCT TGGTCGTATC 500 

CCATCTGCCG TCGGTTACCA ACCCACTCTT GCCGTCGACA TGGGTGGTAT 550 

GCAGGAACGT ATTACCACCA CCAAGAAGGG ATCCATTACC TCC 593 



30 
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2) INFORMATION FOR SEQ ID NO: 778 



(i) SEQUENCE CHARACTERISTICS: 

3 5 (A) LENGTH: 1141 bases 

<B) TYPE: Nucleic acid 
-(C) STRANDEDNESS : Double 
(D) TOPOLOGY: Linear 

4 0 (ii) MOLECULE TYPE: Genomic DNA 

i 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Trichosporon outaneum 

(B) STRAIN: ATCC 62965 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 778 



CCGTGGTCAA GAAGTTATTG ACACTGGTGC CCCAATTACC ATTCCTGTTG 50 

GTCGTGGTAC TCTTGGTAGA ATTATCAACG TCATTGGTGA ACCAATTGAC 100 

50 GAACGTGGCC CTATCAAGGC TTCTAAGTAT GCTCCTATCC ATACTGAACC 150 

ACCAACCTTT GCTGAACAAT CTACTTCTGC TGAAGTTCTT GAAACCGGTA 2 00 

TCAAGGTTGT CGATCTTCTT GCTCCTTACG CCCGTGGTGG TAAGATTGGT 2 50 

CTTTTCGGTG GTGCTGGTGT CGGTAAGACT GTCTTCATTC AAGAACTTAT 3 00 

TAACAACATT GCCAAGGCTC ACGGTGGTTT CTCTGTCTTC ACTGGTGTCG 350 

55 GTGAAAGAAC CCGTGAAGGT AACGATCTTT ACCGTGAAAT GAAGGAAACT 400 

GGTGTCATCA ACCTCGAAGG TGACTCCAAG GTCGCTCTCG TTTTCGGTCA 450 

AATGAACGAA CCTCCAGGTG CCCGTGCCCG TGTCGCTTTG ACTGGTCTTA 500 

CCATTGCCGA ATACTTCCGT GATGAAGAAG GACAAGATGT CTTGCTTTTC 550 

GTTGACAACA TTTTCAGATT CACCCAAGCC GGTTCTGAAG TCTCTGCTCT 600 

60 TTTGGGTCGT ATTCCATCTG CCGTCGGTTA CCAACCTACC CTTGCTACCG 650 
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ATATGGGTGC CCTCCAAGAA 
ACATCTGTCC AAGCCGTCTA 
CCCAGCCACC ACTTTCGCTC 
CCATTTCCGA ATTGGGTATC 
5 TCTCGTCTTT TGGATCCTGA 
CACTCAAGTT CAACAAACCC 
TTGCCATTTT GGGTATGGAT 
GAACGTGCCC GTAAGATCCA 
CGAAGTTTTC ACTGGTATCG 
10 TCAGATCTTT CAAGGAAATC 



CGTATTACCA CCACCCAAAA 
TGTCCCAGCA GACGATTTGA 
ACTTGGACGC CACCACTGTC 
TACCCAGCTG TCGATCCTCT 
AGTTATTGGA CACGAACACT 
TCCAAGCTTA CAAGTCTCTC 
GAATTGTCTG AAGCTGATAA 
AAGATTCCTT TCCCAACCAT 
AAGGTCGTCT CGTTCCATTG 
CTTGAAGGTA AGTACGATCA 



gggtt£cgtc ' 




CTGATCCTGC 


750 


TTGTCTCGTT 


800 


CGATTCCAAG 


850 


ACGAAGTTGC 


900 


CAAGATAT C A 


950 


GCTTACTGTC 


1000 


TCGCTGTTGC 


1050 


AAGGAAACCG 


1100 


C 


1141 
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2) INFORMATION FOR SEQ ID NO: 779 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 93 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 
20 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
25 (A) ORGANISM: Cladophialophora carrionii 

(B) STRAIN: ATCC 22864 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 779 

30 GCTGAAGGCC GAACGTGAGC GTGGTATCAC CATCGATATC GCGCTCTGGA 50 

AGTTCGAGAC TCCCAAGTAC TTCGTCACCG TCATCGATGC CCCTGGTCAT 100 

CGTGACTTCA TCAAGAACAT GATCACTGGT ACCTCCCAGG CTGATTGTGC 150 

TATTCTCATC ATTGCCGCTG GTACTGGTGA GTTCGAGGCC GGTATCTCCA 200 

AGGATGGCCA GACCCGTGAG CATGCTCTGC TCGCCTACAC CCTGGGCGTG 250 

35 AAGCAGCTTA TCGTCGCCAT CAACAAGATG GACACCACCA AATGGTCTGA 3 00 

GGATCGTTTC AACGAAATCA TCAAGGAGAC TTCCAACTTC ATCAAGAAGG 3 50 

TCGGATACAA CCCCAAGTCC GTTCCATTCG TGCCCATCTC CGGCTTCAAC 400 

GGTGACAACA TGATCGACGT CTCCACCAAT GCCCCCTGGT ACAAGGGCTG 4 50 

GGAAAAGGAG TCCAAGGCTG GCAAGGCCAC CGGCAAGACC CTCCTTGAGG 500 

40 CTATCGACTC CATCGACCCT CCTGCTCGTC CCACCGACAA GCCTCTCCGT 550 

CTCCCACTCC AGGATGTCTA CAAGATTTCT GGTATCGGCA CGGTGCCCGT 600 

CGGTCGTGTT GAGACTGGTA CCATCAAGGC CGGTATGGTC GTCACCTTTG 650 

CCCCCGCCAA CGTCACCACT GAAGTCAAGT CCGTCGAAAT GCACCACGAA 700 

CAGCTCGCCG AGGGCGTTCC GGGTGACAAC GTCGGCTTCA ACGTCAAGAA 750 

45 CGTCTCCGTG AAGGAGGTTC GTCGTGGAAA CGTTGCTGGT GACTCCAAGA- 800 

ACGACCCCCC CAAGGGTGCC GACTCCTTCA ACGCCCAGGT CATCGTCCTC 850 

AACCACCCTG GTCAGGTCGG TGCTGGCTAC GCCCCGGTCT TGGATTGCCA 900 

CACTGCCCAC ATTGCCTGCA AGTTCTCTGA GCTCCTCGAG AAGATCGATC 950 

GTCGTACCGG CAAGTCCATG GAAAACAACC CCAAGTTCAT CAAGTCTGGT 10 00 

50 GATGCTGCCA TCGTGAAGAT GGTTCCCAGC AAGCCTATGT GCGTTGAGGC 1050 

TTTCACCGAC TACCCTCCTC TTGGTCGTTT CGCCGTCCGT GAC 1093 



55 2) INFORMATION FOR SEQ ID NO: 780 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 752 bases 

(B) TYPE: Nucleic acid 
60 (C) STRANDEDNESS: Double 
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(D) TOPOLOGY: Linear 
(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Cunninghamella bertholletiae 

(B) STRAIN: ATCC 42115 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 780 



TACTTGTAAA TGGTCTCAAG ATCGTTACAA CGAAATTGTT AAGGAAGTTT 50 

CTTCCTTCAT CAAGAAGATT GGTTACAATC CTAAATCCGT TCCTTTCGTY 100 

CCTATCTCTG GTTGGCACGG TGATAACATG TTGGAAGCTT CTACCAACAT 150 

GCCTTGGTAC AAGGGATGGA CCAAGGAAAC TAAAGCTGGT TCTTCCACTG 2 00 

15 GTAAGACTCT CTTAGAAGCC ATTGACAGCA TTGAACCTCC TACCCGTCCT 250 

TCTGACAAGC CTTTACGTCT TCCTTTACAA GATGTTTACA AGATTGGTGG 3 00 

TATTGGTACT GTCCCTGTTG GTCGTGTTGA AACTGGTGTC ATCAAGGCTG 350 

GTATGGTTGT TACYTTCGCT CCCGCTAACG TCACCACTGA AGTTAAGTCC 400 

GTTGAAATGC ATCACGAACA ATTAGAACAA GGTGTTCCTG GTGACAACGT 450 

20 TGGTTTCAAC GTCAAGAACG TTTCCGTTAA GGATATCCGT CGTGGTAACG 500 

TCTGTTCCGA CTCCAAGAAC GACCCCGCTA AGGAATCTGC TTCCTTCAAC 550 

GCTCAAGTTA TCGTCTTGAA CCACCCTGGT CAAATTGGTG CTGGTTATGC 600 

CCCAGTTCTT GACTGTCACA CTGCTCACAT TGCTTGTAAG TTCGCTGAAT 650 

TATTAGAAAA GATCGATCGT CGTTCCGGTA AGAAACTCGA AGATGCTCCT 700 

25 AAATTCGTTA AATCTGGTGA CTCTGCTATC GTTAAGATGG TTCCTTCCAA 750 

GC 752 



30 2 ) INFORMATION FOR SEQ ID NO: 781 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 728 bases 

(B) TYPE: Nucleic acid 
35 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

40 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Cujrvularia lunata 

(B) STRAIN: ATCC 26425 



45 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 781 



CAAGTGGTCT GAGGACCGTT ACCAGGAAAT CATCAAGGAG ACCTCCAACT 50 

TCATCAAGAA GGTCGGCTAC AACCCCAAGC ACGTTCCCTT CGTCCCCATC 100 

TCCGGTTTCA ACGGAGACAA CATGATTGAG GCCTCCACCA ACTGCCCCTG 150 

GTACAAGGGT TGGGAGAAGG AGACCAAGGC CAAGGCCACT GGTAAGACCC 200 

50 TCCTTGAGGC CATCGACGCC ATCGACCCTC CTGTCCGTCC TACCGACAAG 250 

CCCCTCCGCC TTCCCCTCCA GGATGTTTAC AAGATTGGTG GTATTGGCAC 3 00 

GGTCCCCGTC GGT CGTGTCG AGACCGGTAT CATCAAGCCC GGTATGGTCG 350 

TCACCTTCGC CCCCGCTGGT GTCACCACCG AAGTCAAGTC CGTCGAGATG 400 

CACCACGAGC AGCTTACTGA GGGTGTCCCC GGTGACAACG TCGGCTTCAA 450 

55 CGTCAAGAAC GTCTCCGTCA AGGAGATCCG TCGTGGTAAC GTTGCCGGTG 500 

ACTCCAAGAA CGACCCCCCC AAGGGTTGCG AGTCCTTCAA CGCCCAGGTC 550 

ATCGTCCTCA ACCACCCTGG TCAGGTCGGT GCCGGTTACG CCCCAGTCCT 600 

TGACTGCCAC ACTGCCCACA TTGCCTGCAA GTTCTCCGAG CTCCTCGAGA 650 

AGATCGACCG CCGTACCGGA AAGTCTGTTG AGAACTCCCC CAAGTTCATC 700 

60 AAGTCCGGTG ACGCCGCCAT CGTCAAGA 728 
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2) INFORMATION FOR SEQ ID NO: 782 

5 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1145 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
10 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
15 (A) ORGANISM: Fonsecaea pedrosoi 

(B) STRAIN: ATCC 18831 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 782 

20 TGAAGTCCGA GCGTGAGCGT GGTATCACCA TCGATATCGC CCTCTGGAAG 50 

TTCGAGACTC CCAAGTAAGG CTCAACAGAC ACAACAAGCA AATGCATACT 10 0 

CGCTAACCTA TTCACCCACC ACAGGTACAA CGTCACCGTC ATTGACGCCC 150 

CCGGTCACCG TGATTTCATC AAGAACATGA TCACTGGTAC CTCCCAGGCT 2 00 

GACTGCGCCA TTCTCATCAT TGCCGCCGGT ACTGGTGAGT TCGAGGCCGG 250 

25 TATCTCCAAG GACGGTCAGA CCCGTGAGCA CGCTCTTCTC GCCTACACCC 3 00 

TTGGTGTCAA GCAGCTCATC GTTGC CATC A ACAAGATGGA CACCACCAAG 350 

TGGTCTGAGG CCCGTTACCA GGAGATCATC AAGGAGACCT CCGGTTTCAT 4 00 

CAAGAAGGTC GGCTTCAACC CCAAGCACGT TCCCTTCGTG CCCATCTCCG 450 

GTTTCAACGG TGACAACATG ATCGACGTCT CCACCAACTG CCCCTGGTAC 500 

3 0 AAGGGTTGGG AGAAGGAGAC CAAGGCCAAG GCCACCGGCA AGACCCTCCT 5 50 

CGAGGCCATT GACGCCATCG ACCCCCCCAC TCGTCCCACC GACAAGCCCC 600 

TCCGTCTTCC CCTYCAGGAT GTCTACAAGA TCGGTGGTAT TGGCACGGTT 650 

CCCGTCGGTC GTGTCGAGAC CGGTACCATC AAGGCCGGCA TGGTCGTCAC 700 

CTTCGCCCCC GCTGGTGTCA CCACTGAGGT CAAGTCCGTC GAGATGCACC 750 

35 ACGAGCAGCT TCCCGAGGGT CTCCCCGGTG ACAACGTCGG CTTCAACGTC 800 

AAGAACGTCT CCGTCAAGGA GATCCGTCGT GGCAACGTCG CCGGTGACTC 850 

CAAGAACGAC CCCCCCAAGG GCTGCGACAG CTTCAACGCC CAGGTCATCG 900 

TCCTCAACCA CCCCGGTCAG GTCGGCGCCG GCTACGCGCC CGTCCTCGAC 950 

TGCCACACTG CTCACATTGC TTGCAAGTTC TCTGAGCTCC TCGAGAAGAT 1000 

40 CGACCGCCGT ACCGGCAAGT CCATTGAGGC CAGCCCCAAG TTCATCAAGT 1050 

CTGGTGACGC CGCCATCGTC AAGATGGTTC CCTCCAAGCC TATGTGCGTT 1100 

GAGGCCTTCA CCGACTACCC CCCTCTTGGA CGTTTCGCCG TCCGT 1145 



45 



55 



2) INFORMATION FOR SEQ ID NO: 783 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 1151 bases 
50 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Microsporum audouinix 

(B) STRAIN: ATCC 11347 

60 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 783 
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GCTCAAGGCT GAGCGTGAGC GTGGTATCAC CATTGACATC GCCCTCTGGA 51) 

AGTTCGAGAC CCCCAAGTAC ATGGTCACCG TCATCGGTAT GCTTTATCTG 100 

TTTCCCATTT ATAGTTGCGA CAAGTAACTA ATAAAAAGTA GATGCCCCCG 150 

5 GACACCGTGA CTTCATCAAG AACATGATTA CTGGTACCTC CCAGGCCGAC 2 00 

TGCGCTATTC TCATCATTGC TGCCGGTACT GGTGAGTTCG AGGCTGGTAT 250 

CTCCAAGGAT GGCCAGACTC GTGAGCACGC CCTGCTCGCT TTCACCCTCG 300 

GTGTCAAGCA GCTCATCGTT GCCATCAACA AGATGGACAC CACCAACTGG 350 

TCTGAGTCCC GTTTCGGTGA AATCATCAAG GAAGTCACCA ACTTCATCAA 400 

10 GAAGGTCGGC TACGACCCCA AGGGTGTCCC ATTCGTCCCA ATCTCTGGCT 450 

TCAACGGTGA CAACATGATT GAGCCCTCCA CCAACTGCCC ATGGTACAAG 50 0 

GGATGGAACA AGGAGACCAA GGCCGGTGGC AAATCCTCTG GTAAGACCCT 550 

CCTTGAGGCC ATCGATGCCA TTGACATGCC CACTCGTCCC ACCGACAAGC 600 

CTCTCCGTCT CCCACTCCAG GATGTCTACA AGATCTCTGG TATCGGAACA 650 

15 GTACCAGTCG GTCGTGTTGA GACTGGTATC ATCAAGCCTG GTATGGTTGT 700 

CACTTTCGCC CCCGCCAACG TCACCACTGA AGTCAAGTCC GTCGAAATGC 750 

ACCACCAGCA GCTCGTTCAG GGTGTTCCCG GTGACAACGT TGGCTTCAAC 800 

GTCAAGAACG TCTCTGTCAA GGAAGTCCGC CGTGGTAACG TTGCCGGTGA 850 

TTCCAAGAAC GACCCACCAT CTGGCTGCGC CTCTTTCAAG GCCCAGGTCA 900 

2 0 TCGTCCTCAA CGACCCCGGC CAGATCGGTG CTGGTTACGC CCCAGTCCTC 950 

GACTGCCACA CTGCCCACAT TGCTTGCAAG TTCTCTGAGC TTCTTGAGAA 1000 

GATTGACCGC CGTACTGGTA AATCCGTCGA AACCAGCCCT AAGTTCGTCA 1050 

AGTCTGGTGA TGCCGCTATT GCCACCATGG TTCCATCCAA GCCCATGTGC 1100 

GTTGAGGCTT TCACTGACTA CCCACCACTT GGTCGTTTCG CCGTCCGTGA 1150 

25 C 1151 
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2) INFORMATION FOR SEQ ID NO: 784 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 979 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 
35 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
4 0 (A) ORGANISM: Mucor circinelloides 

(B) STRAIN: ATCC 38592 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 784 

45 CCAAGTGGTC TCAAGATCGT TACAACGAAA TTGTCAAGGA AGTTTCCGGT 50 

TTCATCAAGA AGATCGGTTT CAACCCCAAG TCCGTTCCTT TCGTTCCCAT 100 

TTCTGGCTGG CACGGTGATA ACATGTTGGA TGAATCCACC AACATGCCCT 150 

GGTTCAAGGG ATGGAACAAG GAGACCAAGG CCGGTTCCAA GACTGGTAAG 2 00 

ACTCTCCTCG AAGCCATCGA TGCCATTGAG CCCCCTGTCC GTCCTTCTGA 250 

50 CAAGCCTCTC CGTCTTCCTC TTCAAGATGT CTACAAGATT GGTGGTATTG 3 00 

GTACAGTTCC CGTCGGTCGT GTTGAAACTG GTACTATCAA GGCTGGTATG 3 50 

GTTGTCAACT TCGCTCCCGC TGCTGTCACC ACTGAAGTTA AGTCCGTTGA 400 

AATGCATCAC GAAACCCTCT CTGAAGGTCT CCCCGGTGAC AACGTTGGTT 450 

TCAACGTCAA GAACGTCTCC GTCAAGGATA TCCGTCGTGG TAACGTCTGT 500 

55 TCCGACTCCA AGAACGATCC CGCTAAGGAA TCTGCCTCTT TCACTGCTCA 550 

AGTTATTATC TTGAACCATC CCGGTCAAAT CTCTGCTGGT TACGCACCAG 600 

TTCTCGATTG TCACACTGCT CACATCGCCT GTAAGTTCTC TGAACTCATT 650 

GAGAAGATTG ATCGTCGTTC CGGTAAGTAC CTGCATCTGT CAGAATTGAA 700 

GGTCCGCCGT TATAGCAAAG GCTGGGTTTA AATGTTGGGG TTTGTCTGAT 750 

60 CTATAATGAT GATTGCTCCT TCAATTTTTG ACATAATTTG ATGATCTGAA 800 
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TTGTGTTGCT AACGTCGCAT TTGCTTCTTT 
AAGAAGATGG AAGATGCTCC CAAAGTAAGT 
AAAATAGAAT ACTAACAATT ATTGTTTATA 
CTGCTATCGT CAAGATGGTT CCCTCCAAG 



TGCTTCTTTT GCATGTAGGT 85^ 
ATTACGATTG ATGGACAATT 900 
GTTCGTCAAG TCTGGTGACT 950 

979 



2) INFORMATION FOR SEQ ID NO: 785 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1099 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Phialaphorei verrucosa 

(B) STRAIN: ATCC 38561 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO : 785 

GGACAAGCTG AAGGCCGAAC GTGAGCGTGG TATCACCATC GATATCGCGC 50 

TCTGGAAGTT CGAGACTCCC AAATACTTCG TCACCGTCAT TGATGCCCCT 100 

GGTCATCGTG ACTTCATCAA GAACATGATC ACTGGTACCT CCCAGGCTGA 150 

TTGTGCCATC CTCATCATTG CCGCCGGTAC CGGTGAGTTC GAAGCCGGTA 2 00 

TCTCCAAGGA TGGCCAGACC CGTGAGCACG CTCTCCTCGC CTACACCCTA 250 

GGTGTGAAGC AGCTTATCGT CGCCATCAAC AAGATGGACA CCGCCAAATG 300 

GTCCGAGGAT CGGTTCAACG AAATCATCAA GGAGACTTCC AACTTCATCA 350 

AGAAGGTCGG ATACAACCCC AAGTCCGTCC CGTTCGTGCC CATCTCCGGT 400 

TTCAACGGTG ACAACATGAT CGACGTCTCC TCCAACGCCC CCTGGTACAA 450 

GGGTTGGGAG AAGGAGACCA AGGCCGGCAA GGCCACTGGC AAGACCCTCC 500 

TCGAGGC CAT CGACGCGATT GACCCTCCTA CTCGTCCCAC CGACAAGCCC 550 

CTCCGTCTCC CTCTCCAGGA TGTCTACAAG ATCTCTGGTA TCGGCACGGT 600 

GCCCGTTGGT CGTGTTGAGA CCGGTACCAT CAAAGCCGGT ATGGTCGTCA 650 

CCTTCGCTCC CGCCAACGTC AC CACTGAAG TCAAGTCCGT CGAAATGCAC 700 

CACGAACAGC TCGCCGAAGG TGTTCCAGGT GACAATGTCG GCTTCAACGT 750 

CAAGAACGTC TCCGTCAAGG AGGTTCGTCG TGGAAACGTT GCCGGTGACT 800 

CCAAGAATGA CCCCCCCAAG GGTGCCGACT CCTTCAACGC TCAGGTCATC 850 

GTCCTCAACC ACCCTGGTCA GGTCGGTGCC GGCTACGCCC CGGTCTTGGA 900 

TTGCCACACT GCCCACATTG CTTGCAAGTT CTCTGAGCTC CTCGAGAAGA 950 

TCGATCGTCG TACCGGCAAG TCCATGGAAA ACAACCCCAA GTTTATCAAG 1000 

TCTGGTGATG CTGCCATCGT GAAGATGGTT CCCAGCAAGC CTATGTGCGT 1050 

TGAGGCCTTC ACCGACTATC CTCCTCTTGG TCGTTTCGCC GTCCGTGAC 1099 



2) INFORMATION FOR SEQ ID NO : 786 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 750 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: SaJcsenaea vasiformis 
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(B) STRAIN: ATCC 60625 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 786 



10 



15 



20 



ACCACCAAGT 
CGGYTTCATC 
CCATCTCCGG 
CCCTGGTACA 
GACCMTCCTC 
ACAAGCCCCT 
GGCACGGTTC 
GGTCGTCACC 
AGATGCACCA 
TTCAACGTCA 
CGGTGACTCC 
AGGTCATCGT 
GTCCTSGACT 
CGAGAAGATC 
TCATCAAGTC 



GGTCTGAGGC 
AAGAAGGTCG 
TTTCAACGGT 
AGGGTTGGGA 
GAGGCCATTG 
YCGTCTTCCC 
CCGTCGGTCG 
TTCCCCCCCG 
CGAGCAGCTC 
AGAACGTCTC 
AAGAACGACC 
CCTCAACCAC 
GCCACACTGC 
GACCGCCGTT 
TGGTGACGCC 



CCGTTACCAG 
GCTTCAACCC 
GACAACATGA 
GAAGGAGACC 
ACGCCATCGA 
CTMCAGGATG 
TGTYGAGACC 
CTGGTGTCAC 
GCCGAGGGTS 
CGTCAAGGAG 
CCCCCAAGGG 
CCCGGTCAGG 
TCACATTGCC 
CCGGCAAGTC 
GCCATCGTCA 



GAGATCATCA 
CAAGCACGTT 
TCGACGTCTC 
AAGGCCAAGG 
CCCCCCYAGY 
TYTACAAGAT 
GGTRCCATCA 
CACTGAGGTC 
TCCCCGGTGA 
ATCCGTCGTG 
CTGCGACAGC 
TCGGCGCCGG 
TGCAAGTTCT 
CATCGAGTCC 
AGATGGTTCC 



AGGAGACCTC 50 

CCCTTCGTGC 100 

CACCAACTGC 150 

CCACCGGCAA 200 

CGTCCCACCG 2 50 

TGGCGGTATT 3 00 

AGGGTGGCAT 350 

AAGTCCGTCG 4 00 

CAACGTCGGC 450 

GCAACGTTGC 500 

TTCAACGCCC 550 

CTACGCGCCS 60 0 

CTGAGCTCCT 650 

GGCCCCAAGT 700 

CTCCAAGCCC 750 



2) INFORMATION FOR SEQ ID NO: 787 



25 



30 



35 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1084 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY : Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Sync epha la strum racemosum 

(B) STRAIN: ATCC 3233 0 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 787 



GCTCAAGGCC 

40 AGTTCGAGAC 
CGTGATTTCA 
TATCCTCATC 
AGGATGGCCA 
CGTCAGCTGA 

4 5 GGCCCGTTAC 
TCGGTTTCAA 
GGTGACAACA 
GAAGAAGGAG 
CCATTGACAA 

50 CTTCCCCTCC 
CGGTCGTGTC 
CCCCCGCCAA 
CAGCTCGTCG 
CGTTTCCGTC 

55 ACGACCCCGC 
AACCACCCCG 
CACCGCTCAC 
GTCGTTCCGG 
GACTCCGCCA 

60 CTACACTGAG 



GAGCGTGAGC 
CCCCAAGTAC 
TCAAGAACAT 
ATTGCCGCCG 
GACCCGTGAG 
TCGTCGCCAT 
AACGAAATCG 
CCCCAAGTCC 
TGTTGGAGGA 
ACCAAGGCTG 
CATTGACCCC 
AGGATGTCTA 
GAGACTGGTG 
CGTCACCACT 
AGGGTGTCCC 
AAGGATATCC 
CAAGGAGTCT 
GTCAGATCGG 
ATTGCCTGCA 
YAAGAAGCTC 
TCGTCAAGAT 
TACCCCCCTC 



GTGGTATCAC 
CACGTCACCG 
GATCACTGGT 
GTACTGGTGA 
CACGCTCTGC 
CAACAAGATG 
TCAAGGAGGT 
GTTCCCTTCG 
GTCCTCCAAC 
GCGAGAAGTC 
CCGGTCCGTC 
CAAGATCGGT 
TCATCAAGGC 
GAAGTCAAGT 
CGGTGACAAC 
GCCGTGGTAA 
GCCTCGTTCA 
TGCCGGTTAC 
AGTTCGCTGA 
GAAGAGTCCC 
GGTTCCCTCC 
TTGGCCGTTT 



CATCGATATC 
TCATTGATGC 
ACCTCGCAGG 
GTTCGAGGCT 
TTGCCTTCAC 
GACTCGACCA 
CTCCACCTTC 
TCCCCATCTC 
ATGCCCTGGT 
CGGCAAGACC 
CCTCGGACAA 
GGTATCGGCA 
TGGTATGGTC 
CCGTCGAGAT 
GTCGGTTTCA 
CGTCTGCTCT 
CCGCCCAGGT 
GCCCCGGTTC 
GCTCCTCGAG 
CCAAGTTCGT 
AAGCCCATGT 
CGCC 



GCTCTCTGGA 50 

CCCCGGCCAT 100 

CTGACTGCGG 150 

GGTATCTCCA 200 

CCTCGGTGTC 2 50 

AGTACTCTGA 300 

ATCAAGAAGA 350 

TGGCTGGAAC 400 

TCAAGGGCTG 450 

CTCCTTGAGG 500 

GCCCCTCCGT 550 

CAGTCCCCGT 600 

GTGACCTTCG 650 

GCACCACGAG 700 

ACGTCAAGAA 750 

GACTCCAAGA 800 

CATCGTCCTG 850 

TTGACTGCCA 900 

AAGATCGACC 950 

CAAGTCGGGT 1000 

GCGTTGAGGC 1050 
1084 
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2) INFORMATION FOR SEQ ID NO: 7 88 

5 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1155 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 
10 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
15 (A) ORGANISM: Trichophyton tonsurans 

(B) STRAIN: ATCC 56185 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 788 

2 0 GCTCAAGGCC GAGCGTGAGC GTGGTATCAC CATCGATATC GCCCTCTGGA 50 

AGTTCGAGAC CCCCAAGTAC AATGTCACCG TCATTGGTAT GTTTTTCTTT 100 

ACCTTTCCCC TCCATCGTCT TGCTGTGCCA TAACTAACGA GAGTAGACGC 150 

CCCCGGTCAC CGTGACTTCA TCAAGAACAT GATCACTGGT ACCTCCCAGG 200 

CTGACTGTGC TATTCTCATC ATTGCTGCCG GTACTGGTGA GTTCGAGGCT 250 

25 GGTATCTCCA AGGATGGCCA GACCCGTGAG CACGCTCTGC TCGCCTTCAC 300 

CCTTGGTGTC AAGCAGCTCA TCGTTGCCAT CAACAAGATG GACACCACCA 350 

ACTGGTCCGA GGACCGTTTC AAGGAAATCA TCAAGGAAGT CACCAACTTC 400 

ATCAAGAAGG TTGGCTACGA CCCCAAGGGT GTTCCATTCG TTCCAATCTC 450 

TGGTTTCAAC GGTGACAACA TGATTGAGGC CTCCAGCAAC TGCCCATGGT 500 

30 ACAAGGGATG GAACAAGGAG ACCAAGGCCG GTGGTGCCAA GACTGGCAAG 550 

ACCCTYCTCG AGGCCATCGA TGCCATCGAC ATGCCAACCC GTCCTACCGA 600 

CAAGCCCCTY CGTCTCCCAC TCCAGGATGT CTACAAGATC TCTGGTATCG 650 

GAACTGTACC AGTCGGTCGT GTTGAGACCG GTATCATCAA GCCTGGTATG 700 

GTCGTCACCT TCGCCCCTGC CAACGTCACC ACTGAAGTCA AGTCCGTCGA 750 

35 AATGCACCAC CAGCAGCTTC AGCAGGGTGT CCCCGGTGAC AACGTCGGCT 8 00 

TCAACGTCAA GAACGTTTCC GTCAAGGAAG TCCGCCGTGG TAACGTTGCC 850 

GGTGACTCCA AGAACGACCC ACCATCCGGC TGTGCCTCCT TCAACGCCCA 900 

GGTCATCGTC CTCAACCACC CCGGCCAGAT CGGTGCTGGT TACGCCCCAG 950 

TCCTCGACTG CCACACTGCT CACATTGCTT GCAAGTTCGC TGAGCTCCTC 1000 

40 GAGAAGATTG ACCGCCGTAC CGGTAAATCC GTCGAAGCCA ACCCCAAGTT 1050 

CGTCAAGTCT GGTGATGCCG CTATCGCCAA GATGGTTCCC TCCAAGCCCA 1100 

TGTGCGTTGA GGCTTTCACT GACTACCCCC CACTTGGTCG TTTCGCCGTC 1150 

CGTGA 1155 



45 



2) INFORMATION FOR SEQ ID NO: 789 



(i) SEQUENCE CHARACTERISTICS: 
50 (A) LENGTH: 113 8 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

55 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Trichophyton mentagrophytes 

(B) STRAIN: ATCC 8125 

60 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 78 9 



TCAAGGCCGA 
TTCGAGACCC 
5 CTTTCCCCTC 
CCGGTCACCG 
GACTGTGCTA 
TATCTCCAAG 
TTGGTGTCAA 

10 TGGTCCGAGG 
CAAGAAGGTT 
GTTTCAACGG 
AAGGGATGGA 
CCTCCTCGAG 

15 AGCCCCTCCG 
ACTGTACCAG 
CGTCACCTTC 
TGCACCACCA 
AACGTCAAGA 

20 TGACTCCAAG 
TCATCGTCCT 
CTCGACTGCC 
GAAGATTGAC 
TCAAGTCTGG 

25 TGCGTTGAGG 



GCGTGAGCGT 
CCAAGTACAA 
CATCGTCTTG 
TGACTTCATC 
TTCTCATCAT 
GATGGCCAGA 
GCAGCTCATC 
ACCGTTTCAA 
GGCTACGACC 
TGACAACATG 
ACAAGGAGAC 
GCCATCGATG 
TCTCCCACTC 
TCGGTCGTGT 
GCCCCTGCCA 
GCAGCTTCAG 
ACGTTTCCGT 
AACGACCCAC 
CAACCACCCC 
ACACTGCTCA 
CGCCGTACCG 
TGATGCCGCT 
CTTTCACTGA 



GGTATCACCA 
TGTCACCGTC 
CTGTGCCATA 
AAGAACATGA 
TGCTGCCGGT 
CCCGTGAGCA 
GTTGCCATCA 
GGAAATCATC 
CCAAGGGTGT 
ATTGAGGCCT 
CAAGGCCGGT 
CCATCGACAT 
CAGGATGTCT 
TGAGACCGGT 
ACGTCACCAC 
CAGGGTGTCC 
CAAGGAAGTC 
CATCCGGCTG 
GGCCAGATCG 
CATTGCTTGC 
GTAAATCCGT 
ATCGCCAAGA 
CTACCCCCCA 



TCGATATCGC 
ATTGGTATGT 
ACTAACGAGA 
TCACTGGTAC 
ACTGGTGAGT 
CGCTCTGCTC 
ACAAGATGGA 
AAGGAAGTCA 
TCCATTCGTT 
CCACCAACTG 
GGTGCCAAGA 
GCCAACCCGT 
ACAAGATCTC 
ATCATCAAGC 
TGAAGTCAAG 
CCGGTGACAA 
CGCCGTGGTA 
TGCCTCCTTC 
GTGCTGGTTA 
AAGTTCGCTG 
CGAAGCCAAC 
TGGTTCCATC 
CTTGGTCG 



CCTCTGGAAG 50 

TTCTCTTTAC 100 

GTAGACGCCC 150 

CTCCCAGGCT 2 00 

TCGAGGCTGG 250 

GCCTTCACCC 3.00 

CACCACCAAC 350 

CCAACTTCAT 400 

CCAATCTCTG 450 

CCCATGGTAC 5 00 

CTGGCAAGAC 550 

CCTACCGACA 6 00 

TGGTATCGGA 650 

CCGGTATGGT 700 

TCCGTCGAAA 750 

CGTCGGCTTC 800 

ACGTTGCCGG 850 

AACGCCCAGG 900 

CGCCCCAGTC 950 

AGCTCCTCGA 1000 

CCCAAGTTCG 1050 

CAAGCCTATG 1100 
1138 



30 



35 



40 



2) INFORMATION FOR SEQ ID NO: 790 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 748 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bipolar is hawaiiensis 

(B) STRAIN: ATCC 26067 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 790 



45 CACCACCAAG 
CCAACTTCAT 
CCCATCTCTG 
CCCCTGGTAC 
AGACCCTTCT 

50 GACAAGCCCC 
TGGCACGGTT 
TGGTCGTCAC 
GAGATGCACC 
CTTCAACGTC 

55 CCGGTGACTC 
CAGGTCATCG 
AGTCCTTGAC 
TCGAGAAGAT 
TTCATCAAGT 

60 



TGGTCTGAGG 
CAAGAAGGTC 
GTTTCAACGG 
AAGGGTTGGG 
CGAGGCCATC 
TCCGCCTTCC 
CCCGTCGGTC 
CTTCGCCCCC 
ACGAGCAGCT 
AAGAACGTCT 
CAAGAACGAC 
TCCTCAACCA 
TGCCACACTG 
TGACCGCCGT 
CCGGTGACGC 



AGCGTTACCA 
GGCTACAACC 
AGACAACATG 
AGAAGGAGAC 
GACGCCATCG 
CCTCCAGGAT 
GTGTCGAGAC 
GCTGGTGTCA 
TGCCGAGGGT 
CCGTCAAGGA 
CCCCCCAAGG 
CCCCGGTCAG 
CCCACATTGC 
ACCGGAAAGT 
CGCCATCGTC 



GGAAATCATC 
CCAAGCACGT 
ATTGAGGCTT 
CAAGGCCAAG 
ACCCCCCTGT 
GTGTACAAGA 
CGGTATCATC 
CCACTGAAGT 
GTCCCCGGTG 
GATCCGTCGT 
GTTGCGAGTC 
GTCGGTGCCG 
TTGCAAGTTC 
CTGTTGAGAA 
AAGATGGTTC 



AAGGAGACCT 50 

TCCCTTCGTC 100 

CCACCAACTG 150 

GCCACTGGTA 2 00 

CCGTCCTACC 250 

TTGGTGGTAT 3 00 

AAGCC CGGTA 3 50 

CAAGTCCGTC 400 

ACAACGTCGG 450 

GGTAACGTTG 500 

CTTCAACGCC 550 

GTTACGCACC 600 

TCCGAGCTCC 650 

CTCCCCCAAG 700 

CCTCCAAG 748 
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2) INFORMATION FOR SEQ ID NO: 791 



5 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 958 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

10 

(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Aspergillus fumigatus 
15 (B) STRAIN: ATCC 14110 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 791 



CGCTATTGTC GTTGTTGCTG CCTCCGACGG TCAGATGTAG GTGGAACATC 50 

20 TTGGGAAATA CGTCGTAAAA CACGGCGCTT ACGTTTTCGC GAATAGGCCC 100 

CAGACTCGTG AGCATTTGCT GCTCGCCCGC CAGGTTGGTG TCCAGAAGAT 150 

CGTTGT CTTC GTCAACAAAA TCGATGCTAT TGATGATCCG GAGATGCTGG 200 

AACTGGTCGA ACTCGAGATG CGTGAGCTGC TGAACAGCTA CGGTTTCGAG 250 

GGTGAAGAGA CTCCGATCAT TTTCGGTTCC GCTCTCTGTG CTCTCGAAGG 300 

25 ACGCCGTGAC GACATCGGTA AAGACAGAAT TGAGCAGCTT ATGAACGCTG 3 50 

TCGACACCTG GATCCCCACT CCTCAGCGTG ACCTCGACAA ACCTTTCTTG 40 0 

ATGTCTGTCG AGGAAGTGTT CTCTATCGCC GGCCGTGGTA CCGTGGCTTC 450 

TGGTCGTGTC GAGCGTGGTA TCTTGAAGAA GGACTCTGAG GTTGAGATTG 500 

TTGGAGGCTC CTTCGAACCC AAGAAGACCA AAGTCACCGA CATTGAAACC 550 

3 0 TTCAAGAAGA GCTGTGATGA ATCGCGTGCT GGTGACAACT CTGGTCTCCT 600 

CCTGCGTGGT ATCCGACGTG AAGACGTCAA GCGTGGTATG GTCATTGCTG 650 

TTCCCGGCAG CACCAAAGCT CACGACAAGT TCCTCGTCTC CATGTACGTC 700 

CTGACCGAGG CGGAGGGTGG TCGTCGTACT GGCTTCGGTG CCAACTACCG 750 

TCCCCAAGTC TTCATCCGTA CTGCAGGTAA GTTCCCGCAC AC CGTGTCCA 800 

35 GATCTTCCGA GAGATTAGCG ATATATGCTA ATGATTCATC AGACGAGGCT 850 

GCTGACCTCA GCTTCCCTGA CGGCGACCAA TCTCGCAGAG TTATGCCTGG 900 

TGACAACGTC GAGATGATCC TGAAGACCCA CCACCCTGTT GCTGCTGAGG 950 

CTGGTCAA 958 

40 



2) INFORMATION FOR SEQ ID NO: 792 

(i) SEQUENCE CHARACTERISTICS: 
45 (A) LENGTH: 93 6 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

50 (ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Trichophyton mentagrophytes 

(B) STRAIN: ATCC 8125 

55 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 792 



CGTTGTCGCA GCTTCTGACG GTCAAATGTA ATTGAATGCC CGCCCAGACG 50 

GATGAAAGGA TTTGACGTTT CTAACATCAT TCTAGGCCTC AGACCAGAGA 100 

60 ACATTTGCTC CTTGCCCGCC AGGTCGGTGT CCAGAAGCTG GTCGTTTTCG 150 
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TTAACAAGGT CGATGCCGTT GAGGACCCAG AGATGTTGGA GCTTG^CGAA '2"0 0' 

CTTGAAATGC GTGAACTCCT CAGCCACTAC GGTTTCGAGG GTGAGGAGAC 250 

CCCCATCATT TTTGGCTCTG CTCTCTGTGC CCTCGAGTCC CGTCGACCTG 3 00 

AGCTTGGTGT CGAGAAGATT GACGAGCTAT TGAACGCCGT CGACACCTGG 350 

5 ATCCCCACCC CCGAGCGCGC CACTGATAAG CCTTTCCTCA TGTGCATTGA 4 00 

GGAAGTGTTC TCTATCTCTG GTCGTGGTAC CGTCGTCTCC GGTCGTGTTG 450 

AGCGTGGTAT CCTCAAGAAG GATTCCGACG TCGAAATTGT TGGTGGCTCT 50 0 

ACCACCCCTA TCAAGACCAA GGTCACAGAT ATCGAAACCT TCAAGAAGTC 55 0 

CTGCGATGAA TCTCGAGCTG GTGACAACTC TGGTCTCCTT CTCCGAGGTA 60 0 

10 TCAAGCGTGA GGACTTGAAG CGTGGAATGG TTGTTGCTGC CCCCGGATCC 650 

ACCAAGGCTC ACACCGACTT CATGGTCTCC CTCTACGTCC TGACTGAGGC 70 0 

TGAGGGTGGT CGTTCCAACG GCTTCACCCA CAAGTACCGC CCCCAAATGT 750 

TCATCCGTAC TGCTGGTATG TAACCCAAGT TTCCGCTATT TACTAAGTAG 80 0 

ATCATTGCTA ACTTGTATTT CCTTCCGTAG ACGAAGCCGC ATCTTTCAGC 850 

15 TGGCCTGGAG AAGACCAAGA CAAGAAGGCT ATGCCTGGTG ACAACGTCGA 90 0 

GATGATTTGC AAGACCCTCC ACCCCATTGC TGCCGA 93 6 



20 2) INFORMATION FOR SEQ ID NO: 793 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 0 bases 

(B) TYPE: Nucleic acid 
25 (C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 
30 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 793 

TTATTGTTGC TGCTGGTACT 2 0 



35 



45 



50 



2) INFORMATION FOR SEQ ID NO: 794 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 20 bases 
40 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Trypanosoma cruzi 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7 94 

GACGACAAGT CGGTGAACTT 2 0 



55 2) INFORMATION FOR SEQ ID NO: 795 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 0 bases 

(B) TYPE: Nucleic acid 
60 (C) STRANDEDNESS: Single 
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(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(vi) ORIGINAL SOURCE; 

(A) ORGANISM: Trypanosoma cruzi 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 795 

ACTTGCACGC GATGTGGCAG 



2) INFORMATION FOR SEQ ID NO: 796 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY : Linear 

( i i ) MOLECULE TYPE : DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 796 
GGTCCAATGC CWCAAACWAG A 



2) INFORMATION FOR SEQ ID NO: 797 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 7 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 797 
CATTAAGAAT GGYTTATCTG TSKCTCT 



2) INFORMATION FOR SEQ ID NO: 798 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Cryptosporidium parvum 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 798 
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22 



5 2 ) INFORMATION FOR SEQ ID NO: 799 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 bases 

(B) TYPE: Nucleic acid 
10 (C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

15 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Cryptosporidium parvum 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 799 

2 0 ACCTGTGAAT ACAAGCAATC T 21 



25 



50 



55 



60 



2) INFORMATION FOR SEQ ID NO: 8 00 



(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 24 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: single 
30 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(vi) ORIGINAL SOURCE: 
35 (A) ORGANISM: Cryptosporidium parvum 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 800 

GATGAAATCT TCAACGAAGT TGAT 24 

40 

2) INFORMATION FOR SEQ ID NO: 801 

45 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Cryptosporidium parvum 

(xi) SEQUENCE -DESCRIPTION: SEQ ID NO: 801 

ACAACACCGA GAAGATCCCA 2 0 
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2) INFORMATION FOR SEQ ID NO: 802 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Cryptosporidium parvum 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8 02 

TTGCCATTTC TGGTTTCGTT 



2) INFORMATION FOR SEQ ID NO: 803 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 0 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Cryptosporidium parvum 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 803 

ACTTCAGTGG TAACACCAGC 



2) INFORMATION FOR SEQ ID NO: 804 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Cryptosporidium parvum 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 804 

CCTGGGACGG CCTCTGGCAT 



2) INFORMATION FOR SEQ ID NO: 805 
(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Cryptosporidium parvum 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 805 

CTCTTGTCCA TCTTAGCAGT 20 



2 ) INFORMATION FOR SEQ ID NO: 806 



(i) SEQUENCE CHARACTERISTICS: 
20 (A) LENGTH: 21 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

25 (ii) MOLECULE TYPE: DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Cryptosporidium parvum 

3 0 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 806 

AGCATCACCA GACTTGATAA G 21 



2 ) INFORMATION FOR SEQ ID NO: 807 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 20 bases 
40 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Cryptosporidium parvum 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 807 

AAAGTGGCTT CAAAGGTTGC 20 



55 2 ) INFORMATION FOR SEQ ID NO: 808 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 bases 

(B) TYPE: Nucleic acid 
60 (C) STRANDEDNESS: Single 
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(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

5 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 808 

GCITTAIWRG CATTAGAARA YCCA 24 



10 



20 



25' 



40 



50 



55 



2) INFORMATION FOR SEQ ID NO: 809 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 24 bases 
15 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 809 
TCTTCCTGTW GCAACTGTTC CTCT 24 

2) INFORMATION FOR SEQ ID NO: 810 



(i) SEQUENCE CHARACTERISTICS: 
30 (A) LENGTH: 24 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

35 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 810 

AGAGMWACAG ATAARSCATT CTTA 24 



2) INFORMATION FOR SEQ ID NO: 811 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 25 bases 
45 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 811 
TRAARTAGAA TTGTGGTCTR TATCC 25 

2) INFORMATION FOR SEQ ID NO: 812 



(i) SEQUENCE CHARACTERISTICS: 
60 (A) LENGTH: 26 bases 
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(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 812 
GTIACIGGIT CYTYRARRTT ICCICC 26 



2) INFORMATION FOR SEQ ID NO: 813 

15 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

20 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 813 
25 AATCYGTYGA AATGCAYCAC GA 22 



2 ) INFORMATION FOR SEQ ID NO: 814 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 
35 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 814 

40 

GCI GGCACGT ACACIGCCTG 20 

45 2) INFORMATION FOR SEQ ID NO : 815 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 
50 (C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 
55 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 815 

TGGTGCATYT CKACRGACTT 20 
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2) INFORMATION FOR SEQ ID NO: 816 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 816 
GCTACGACGA GATCAAGGGC 20 



2) INFORMATION FOR SEQ ID NO: 817 



(i) SEQUENCE CHARACTERISTICS: 
20 (A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

25 (ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 817 
TGGAAGAAGG CCGAGGAGTT 20 

30 

2) INFORMATION FOR SEQ ID NO: 818 

35 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

40 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 818 
45 AGCCGGGCTG GATCTTCTTC 20 



2 ) INFORMATION FOR SEQ ID NO: 819 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 
55 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 819 

60 
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2) INFORMATION FOR SEQ ID NO: 82 0 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Trypanosoma brucei 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 820 

GAAGGAGGTG TCTGCTTACA C 



2) INFORMATION FOR SEQ ID NO: 821 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Trypanosoma, brucei 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 821 

GGCGCAAACG TCACCACATC A 



2) INFORMATION FOR SEQ ID NO: 822 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Trypanosoma brucei 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 822 

CGGCGGATGT CCTTAACAGA A 
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2 ) INFORMATION FOR SEQ ID NO: 823 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 823 
GAGCGGTATG AYGAGATTGT 



2) INFORMATION FOR SEQ ID NO: 824 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 824 
GGCTTCTGCG GCACCATGCG 



2 ) INFORMATION FOR SEQ ID NO: 825 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 825 
ATGAGCARCG SAACCATCGT TCAGTG 



2) INFORMATION FOR SEQ ID NO: 826 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 6 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 82 6 
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TCGATCGTGC CGAC CATGTA GAACGC 2 6 



2) INFORMATION FOR SEQ ID NO: 827 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 446 bases 
10 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



15 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Clostridium novyi 

(B) STRAIN: ATCC 19402 

20 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 827 

CACCAACTTG CTAAATGGGG AGATGC CCAG ATTGTTGTAT ATATAGGCTG 50 

TGGAGAACGT GGAAATGAAA TGACAGATGT TCTTAATGAG TTTGCAGAAC 100 

TTAAAGATCC TAAGACTGGC AAATCAATAA TGGAAAGAAC AGTTTTAATA 150 

25 GCAAATACTT CTAATATGCC AGTTGCAGCC CGTGAAGCTT GTATATATAC 200 

AGGAATCACA ATAGCAGAAT ATTTTAGAGA TATGGGATAT TCAGTAGCAC 250 

TTATGGCGGA TTCCACTTCA CGTTGGGCAG AGGCATTAAG AGAAATGTCT 300 

GGAAGACTTG AAGAAATGCC TGGTGATGAA GGTTACCCAG CTTATTTAGG 350 

ATCAAGACTT GCTGATTTCT ATGAAAGAGC TGGAAAAGTT GTGTGTTTAG 400 

30 GAGACGATGA AAGAGAAGGT GCCATTACTG CAATAGGTGC TGTATC 446 



2) INFORMATION FOR SEQ ID NO: 828 



35 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 445 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
4 0 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
45 (A) ORGANISM: Clostridium difficile 

(B) STRAIN: 9689 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 828 

50 CAGCATCAGC TTGCTAAATG GGCAGATGCA GATATAGTTG TATATATAGG 50 

CTGTGGCGAG CGTGGAAATG AAATGACAGA TGTTCTTCTT GAATTTCCTG 100 

AATTAAAAGA CCCAAGAACA GGCGAGTCAC TTATGCAAAG AACTGTGCTT 150 

ATAGCAAATA CATCAGATAT GCCGGTTGCT GCACGTGAAG CTTCTATATA 200 

CACTGGTATT ACAATAGCTG AATATTTTAG AGATATGGGA TATAGTGTTG 250 

55 CACTTATGGC AGACTCTACA TCAAGATGGG CTGAGGCTCT TAGAGAGATG 300 

AGTGGTCGTT TAGAGGAGAT GCCTGGTGAA GAAGGTTATC CTGCATACTT 350 

AGGTTCACGT CTTGCTCAAT TCTATGAGAG AGCAGGAAAG GTAAATTGTC 400 

TAGGTATGGA TGAAAGAGAA GGAACACTTA CAGCAATTGG TGCAG 445 

60 
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2) INFORMATION FOR SEQ ID NO: 829 

(i) SEQUENCE CHARACTERISTICS: 
5 (A) LENGTH: 445 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

10 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Clostridium septicum 

(B) STRAIN: ATCC 12464 



15 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 829 



ATGCTATAGC TAAATGGGGA GACAGCGAAA TAGTTGTTTA CGTTGGATGT 50 

GGAGAACGTG GTAACGAAAT GACAGACGTT CTTAACGAAT TCCCAGAACT 100 

20 TATTGACCCA AAAACTGGGG AAAGTTTAAT GAAGAGAACA GTACTTATAG 150 

CTAATACTTC AAACATGCCA GTTGCTGCTA GAGAAGCTTG CATATACACA 200 

GGTATTACAA TAGCTGAATA CTTCAGAGAT ATGGGATACT CAGTATCTAT 250 

AATGGCTGAT TCAACTTCAA GATGGGCAGA AGCATTAAGA GAAATGTCAG 3 00 

GTAGACTTGA AGAAATGCCA GGTGATGAAG GATATCCAGC GTACTTAGGA 350 

25 TCAAGACTTG CTGATTATTA CGAAAGAGCA GGTAAGGTTG TTTGTCTAGG 400 

TAAAGATGGT AGAGAAGGTG CTGTAACAGC AATTGGAGCT GTATC 445 



3 0 2) INFORMATION FOR SEQ ID NO : 830 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 444 bases 

(B) TYPE: Nucleic acid 
35 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

40 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Clostridium botulinum 

(B) STRAIN: 20:3.1 



45 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 830 



TCAAATTGCT AAATGGGGAG ATGCAGAAAT CGTTGTTTAC GTTGGATGCG 50 

GAGAACGTGG TAACGAAATG ACAGACGTTG TTAATGAGTT CCCAGAACTT 100 

ATTGACCCTA AGACTGGCGA AAGCTTAATG AAGAGAACAG TTCTTATAGC 150 

TAATACTTCA AACATGCCAG TTGCAGCGAG AGAAGCTTCA ATATATACAG 200 

50 GTATCACAAT AGCTGAATAT TTCAGAGATA TGGGATATGC AGTATCAATA 250 

ATGGCTGACT CAACTTCAAG ATGGGCTGAG GCATTAAGAG AAATGTCTGG 300 

TAGACTTGAA GAAATGCCTG GTGATGAAGG ATATCCAGCT TACCTTGGAT 350 

CAAGACTTGC TGATTACTAT GAAAGAGCTG GTAAGGTTGA ATGTTTAGGT 400 

AATGATGGAA GAATTGGTTC TATAACAGCA ATCGGTGCGG TATC 444 



2) INFORMATION FOR SEQ ID NO: 831 



(i) SEQUENCE CHARACTERISTICS: 
60 (A) LENGTH: 456 bases 
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(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

5 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Clostridium perfringens 

(B) STRAIN : ATCC 13124 

10 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 831 

TAGTTCAGCA CCAAGTTGCT AAATGGGGAG ATACTGAGAT AGTTGTTTAC 50 

GTTGGATGTG GAGAACGTGG TAACGAGATG ACAGACGTTC TTAACGAATT 100 

15 CCCAGAACTT AAAGACCCTA AAACTGGGGA AAGCTTAATG AAGAGAACAG 150 

TTCTTATTGC TAATACATCT AACATGCCAG TTGCTGCCAG AGAAGCATCA 2 00 

ATATATACTG GTATAACAAT AGCAGAGTAT TTCAGAGATA TGGGATACTC 250 

AGTATCAATC ATGGCTGACT CAACTTCACG TTGGGCAGAG GCTTTAAGAG 300 

AAATGTCAGG AAGACTTGAA GAAATGCCAG GAGACGAAGG TTACCCAGCA 350 

2 0 TACTTAGGAT CAAGACTTGC TGATTACTAT GAAAGAGCTG GTAAAGTTGT 4 00 

AGCTTTAGGT AAAGATGGAA GAGAAGGAGC TGTTACAGCT ATCGGAGCAG 4 50 

TATCCC 4 5 S 



25 2 ) INFORMATION FOR SEQ ID NO: 832 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 444 bases 

(B) TYPE: Nucleic acid 
30 (C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

35 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Clostridium tetani 

(B) STRAIN: ATCC 19406 



40 



60 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 832 



CCAACTTGCA AAATGGGCTG ATGCTCAAAT AGTTGTGTAC ATAGGATGTG 50 

GAGAACGTGG AAATGAAATG ACAGACGTTT TAAATGAGTT CCCAGAATTA 100 

AAGGATCCTA AAACCGGGGA ATCTTTAATG AAAAGAACTG TGTTAATAGC 150 

AAATACATCT AATATGCCTG TTGCAGCTAG AGAAGCATCT ATATATACTG 200 

45 GTATAACAAT AGGGGAATAT TTTAGAGATA TGGGATATTC AATAGCACTA 250 

ATGGCAGATT CGACTTCTAG ATGGGCAGAG GCTCTAAGAG AAATGTCTGG 300 

AAGACTAGAG GAGATGCCAG GTGAAGAAGG TTATCCAGCT TATTTAGGAT 3 50 

CTAGATTAGC AGAGTTCTAT GAAAGAGCAG GTAATGTTAT ATGTTTAGGT 40 0 

CAGGATGGAA GAGAAGGAGC ATTAACAGCT ATAGGAGCAG TTTC 444 

50 

2) INFORMATION FOR SEQ ID NO: 833 

55 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1786 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 
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(ii) MOLECULE TYPE: Genomic DNA " 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus pyogenes 
(C) ACCESSION NUMBER: Genome project 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 833 



PCT/CA00/01150 



TGAACCAAGG 

10 GGGATGCAAG 
CTTAGTCGGA 
TTTATGAGGA 
GGTTGTCCTT 
TGACGGTATT 

15 ACTTTTTAAT 
TGGGCATTTA 
TATTTTAGGA 
TTCCTTATAA 
ACAGTAACAG 

2 0 CCAAGGTAGC 
CTCAAAAGCT 
GACACCTTTT 
ATTTGGGGCA 
ACGTTGATAT 

25 ACCGACGTTT 
GTCCATTATG 
TAGCAGCGCG 
TTCCGTGATA 
TTGGGCAGAA 

30 GTGATGAAGG 
GAACGGGCTG 
CATTACAGCC 
CTGTCACTCA 
GCGCCTCTTG 

35 TTATTCATTG 
AATCTAATTG 
GAAGCCAGTC 
TGAACAAGAT 
ATCTCCAGCA 

4 0 AAACAAGAGG 
CAAAGCCCTT 
CTCAGGTACG 
TTAGAGCAGA 
CGTTTTAGCA 

45 



AAAAATAATA 
AAGCTAATAT 
GAAATTATTG 
AACATCAGGG 
TGTCGGTCGA 
CAGCGACCGC 
CCGTGGTGTG 
TTCCCAAGCT 
ACTGTGCAAG 
AGTTTCAGGG 
ATACAGTTTA 
CTCATGCAGA 
TATCCCAGTC 
TCCCTGTTAC 
GGAAAAACAG 
TGTTATTTAT 
TGAATGAGTT 
GAGCGCACGG 
TGAAGCGTCG 
TGGGCTATTC 
GCTCTGCGCG 
CTACCCGGCT 
GTCGTGTTCG 
ATCGGCGCGG 
AAACACCCTT 
CGCAACGGCG 
TATCAAGATG 
GTCCAACAAG 
TAGAAGAAAT 
CGTTTGACCA 
AAATGCCTTT 
CCATGCTAAC 
TCTTTGGGAG 
CGATCGCATC 
TTAAAGGGCT 
AAGGGAGGAA 



ACCGTTTCGG 
TCAAGATATT 
AAATGCGTCG 
ATCGGTCCAG 
GTTAGGCCCG 
TTGATCGTTT 
• GCTATCCCAA 
AAGTGTTGGT 
AAACAGCTGT 
ACCTTGGTGG 
TGAAATTAAG 
CTTGGCCAGT 
GAACCTTTGG 
AAAAGGTGGT 
TTGTGCAGCA 
GTCGGTTGTG 
TCCAGAGTTA 
TGTTAATTGC 
ATTTACACAG 
TGTGGCTATC 
AGATGTCAGG 
TACTTAGGGA 
GACCTTGGGA 
TTTCTCCTCC 
CGGATTGTCA 
TCACTTCCCA 
ATGTAGGAAG 
GTAACTCGTG 
TGTACGCTTG 
TGGCTGTTGC 
GATTCGGTGG 
CAATATTTTG 
CTTATTTTAA 
GCACGCAGCA 
TACTCAGAAG 
TTTAGATGAG 



GACCTCTTGT 
TGTCGTGTGG 
CGATCAAGCG 
GAGAACCAGT 
GGCCTGATTT 
TCAAAAAGCA 
GTCTTGATCG 
CAAGAAGTAG 
CATTGAGCAC 
CTATTCATGC 
CAGGAAGACG 
TCGTCAAAGT 
TTACAGGTCA 
GCCGCTGCCG 
TCAAATAGCT 
GGGAACGCGG 
ATTGACCCAA 
AAACACCTCT 
GTATTACCAT 
ATGGCAGACT 
ACGCCTACAA 
GTCGTATTGC 
AGTCAAGAAC 
TGGAGGGGAT 
AAGTTTTCTG 
GCGATTAACT 
CTATATTGAC 
CCATGGCTAT 
GTGGGGCTTG 
CCGGCAAATT 
ATACCTTTAC 
ACCTTTAATG 
TGAGATTATG 
AATTTATCCC 
GTTACCAAAG 
CGTTCT 



TGTGGCTTCT 50 

GACATCTTGG 100 

TCTATTCAGG 150 

AGTGACTACT 200 

CAGAAATGTT 2 50 

ACGGACAGCG 3 00 

AAAGGCTAAG 350 

TTGCAGGTGA 4 00 

CGTATCATGG 450 

AGGGGACTTC 500 

GTTCCATTTA 550 

CGCCCTGTTG 600 

ACGGGTTATT 650 

TTCCTGGACC 7 00 

AAATTTGCCA 750 

CAACGAGATG 800 
ATACAGGCCA ' 850 

AATATGCCAG 900 

TGCCGAATAT 950 

CGACATCACG 1000 

GAAATGCCTG 1050 

CGAATATTAT 1100 

GTGAGGGAAC 1150 

ATTTCAGAGC 12 00 

GGGGCTCGAC 12 50 

GGCTGACGTC 13 00 

CGTAAACAGC 1350 

TTTGCAGCGT 14 00 

ATTCACTGTC 1450 

CGGGAGGATT 1500 

TTCCTTTCCG 1550 

AGGAAGCCAG 1600 

GAAGGCACTG 1650 

AGAAGAAAAC 17 00 

AGATTCACCA 1750 
1786 



2) INFORMATION FOR SEQ ID NO: 834 

50 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 499 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



55 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Babesia bovis 
60 (B) STRAIN: Suarez-3 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 834 

TATCTCACGT AAGTTTTTGC GCGGCGGTTA TATATCACTC CAGGCCCTGG 50 

CTAAGTACGC TAATACTGAC GTTACTGTCT ATGTGGGATG TGGAGAGCGT 100 

GGAAACGAGA TTGCGGAGGT GCTTAAGGAG TTCCCTGAGC TGAAGACCAA 150 

GGTTGATGGC AAGGAAGTGA GCATTATGAA ACGCACTTGC TTGGTGGCCA 2 00 

ATACTTCAAA CATGCCAGTG GCCGCCAGGG AGGCTAGTAT CTACACTGGC 250 

ATTACCCTAT GTGAATACTT CAGGGATATG GGATACAACG CCTGTGTGAT 3 00 

GGCGGATTCC ACCAGTCGTT GGGCTGAGGC TTTGCGTGAG ATATCAGGTC 3 50 

GTTTAGCTGA GATGCCTGCT GATTCAGGTT ATCCCGCCTA CCTTGCTTCT 400 

AGGCTTTCGG CGTTCTATGA GCGTGCTGGT ACAGCTGAGT GTATTGGAAC 450 

ACCACTTCGT GAAGGTTCAG TTACCATTGT TGGTGCTGTA TCTCCACCA 4 99 



2) INFORMATION FOR SEQ ID NO: 835 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 464 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Cryptosporidium parvum 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 83 5 

TTTCTCAAGC TTTGAGTAAA TATAGTAACT CTGATGTTAT TATTTACATT 50 

GGTTGTGGAG AAAGAGGAAA TGAAATGGCA GAAGTTCTTA CAGAATTCCC 100 

TGAGCTTTAT ACTATGGTTG ATGGAAAGAA GGAGTCAATT ATGCAAAGAA 150 

CTTGTTTAGT AGCTAATACA TCAAATATGC CTGTCGCTGC TAGAGAAGCT 2 00 

TCCATCTACA CTGGTATTAC ACTTTCTGAA TACTTTAGAG ATATGGGATG 250 

TAATGTTTCT ATGATGGCAG ATTCAACTTC TCGTTGGGCT GAAGCTCTTA 3 00 

GAGAAATTTC TGGTAGATTA GCTGAAATGC CTGCAGATTC GGGTTACCCA 3 50 

GCATATTTAG GCGCCAGACT TGCTTCATTC TATGAAAGAT CAGGAAGAGT 4 00 

TAAATGTATG GGTTCCCCAG ATAGAGAAGG TACAGTAACA ATTGTTGGTG 450 

CAGTTTCTCC ACCT 4 64 



2) INFORMATION FOR SEQ ID NO: 836 
3) 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 446 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Leishmanial infantum 

(B) STRAIN: MOU 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 836 

467 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 

GCCCTCTCCA AGTACTCCAA CTCCGATTGC 
CGAGCGCGGT AATGAGATGG CCGAGGTGCT 
CGACCGTGAT CGATGGCCGC GAGGAGTCGA 
GTGGCGAACA CCTCGAACAT GCCAGTCGCA 
5 CACCGGCATC ACCCTGGCCG AGTACTACCG 
CCATGATGGC TGACTCGACG TCTCGCTGGG 
TCGGGTCGTC TGGCGGAGAT GCCGGCGGAT 
CAGCGCTCGT CTCGCCTCCT TCTACGAGCG 
TCGGCGGGCC GAAGCGCCAG GGCTCCGTCA 



PCT/CA00/01150 

GTCATCTATG TC&GCTGCGG ' 50 

CATGGAGTTC CCGACCCTGA 100 

TCATGAAGCG CACCTGCCTC 150 

GCCCGTGAGG CCTCTATTTA 200 

TGATATGGGC AAGCATATCG 250 

CCGAGGCGCT TCGTGAGATT 3 00 

GGTGGCTACC CCGCCTACQT 350 

CGCCGGCCTC GTCACCTGCA 4 00 

CGATCGTCGG TGCCGT " 446 



2 ) INFORMATION FOR SEQ ID NO: 837 

15 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 456 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



20 



40 



55 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Leishmania major 
25 (B) STRAIN: ATCC 50122 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 837 

TTAGTCAGGC CCTCTCCAAG TACTCCAACT CCGACTGCGT CATCTATGTC 50 

3 0 GGCTGCGGCG AGCGCGGTAA TGAGATGGCT GAGGTGCTCA TGGATTTCCC 100 

AACTCTGACG ACCGTGATCG ATGGTCGCGA GGAGTCCATC ATGAAGCGCA 150 

CCTGCCTCGT GGCAAACACT TCGAACATGC CAGTCGCAGC CCGCGAGGCC 200 

TCTATTTACA CCGGCATCAC CCTGGCCGAG TACTACCGTG ATATGGGCAA 250 

GCATATTGCC ATGATGGCCG ACTCGACATC TCGCTGGGCC GAGGCGCTTC 3 00 

35 GTGAGATTTC CGGTCGTCTG GCGGAGATGC CAGCCGATGG TGGCTACCCT 350 

GCCTACCTCA GCGCTCGTCT CGCCTCCTTC TACGAGCGCG CCGGCCTCGT 400 

CACCTGCATC GGCGGGCCGA AGCGCCAGGG CTCCGTCACG ATCGTCGGTG 450 

CTGTGT 456 



2 ) INFORMATION FOR SEQ ID NO: 838 



(i) SEQUENCE CHARACTERISTICS: 
45 (A) LENGTH: 450 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

50 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM:. Leishmania tarentolae 

(B) STRAIN: MOU-2 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 838 



AGGCCCTCTC CAAGTACTCC AACTCCGACT GCGTCATCTA CGTCGGCTGC 50 

GGCGAGCGCG GTAATGAGAT GGCCGAGGTG CTCATGGAGT TCCCGACCCT 100 

60 GACGACTGTG ATTGATGGCC GTGAGGAGTC GATCATGAAG CGGACCTGCC 150 

468 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 



PCT/CA00/01150 



TCGTGGCGAA CACCTCCAAC 
TACACTGGTA TCACCCTGGC 
CGCCATGATG GCTGACTCGA 
TTTCGGGTCG CCTGGCGGAG 
5 CTTAGTGCTC GTCTTGCCTC 
CATCGGTGGG CCGAAGCGCC 



ATGCCAGTCG CAGCCCGTGA 
CGAATACTAC CGTGATATGG 
CGTCTCGCTG GGCGGAGGCG 
ATGCCGGCTG ATGGTGGGTA 
CTTCTACGAG CGTGCCGGCC 
AGGGCTCCGT CACGATCGTC 



A&CCTCTATT ~ 200 

GCAAGCATAT 250 

CTTCGTGAGA 300 

CCCCGCCTAC 350 

TCGTCACCTG 4 00 

GGTGCCGTGT 450 



10 2) INFORMATION FOR SEQ ID NO: 839 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 437 bases 

(B) TYPE: Nucleic acid 
15. (C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

20 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Trypanosoma brucei 

(B) STRAIN: EATRO 7 95 



25 



45 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 839 



GCACTCTCGA AGTACTCGAA CAGTGACGCT GTTATTTACG TGGGTTGTGG 50 

TGAGCGTGGC AATGAGATGG CGGAGGTGCT CATGGACTTT CCCACCCTCA 100 

CCACCATTAT TGATGGACGT GAGGAGTCTA TCATGAAGCG CACATGCCTG 150 

GTGGCAAATA CTTCCAATAT GCCTGTTGCT GCTCGTGAGG CATCTATTTA 200 

3 0 CACTGGTATC ACCTTAGCTG AGTATTATCG GGATATGGGA AAACACATCG 2 50 

CCATGATGGC CGATTCAACC TCCCGCTGGG CTGAGGCTTT GCGTGAGATT 3 00 

TCCGGTCGTC TTGCTGAAAT GCCTGCAGAT GGAGGTTATC CCGCGTACCT 3 50 

CAGCGCCCGT TTGGCCTCCT TCTACGAGCG TGCTGGCCGT GTGACATGCA 4 00 

TCGGTGGGCC GAAGCGTGAA GGGTCTGTAA CAATTGT 437 

35 

2) INFORMATION FOR SEQ ID NO: 840 

40 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1052 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Trypanosoma cruzi 
50 (B) STRAIN: MM3 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 84 0 

TGAAGGCTGA GCGCGAGCGC GGCATCACGA TCGACATCGC GCTCTGGAAG 50 

55 TTCGAGTCGC CCAAGTCTGT GTTCACGATC ATCGACGCCC CCGGCCACCG 100 

CGACTTCATC AAGAACATGA TCACGGGCAC GTCTCAGGCG GACGCCGCCG 150 

TCCTTGTCAT TGCGTCATCG CAGGGTGAGT TTGAGGCGGG CATCTCGAAG 2 00 

GACGGCCAGA CACGCGAGCA CGCGCTGCTC GCCTTCACGC TCGGCGTGAA 250 

GCAGATGGTT GTGTGCTGCA ACAAGATGGA CGACAAGTCG GTGAACTTCG 3 00 

60. CCCAGGAGCG CTACGATGAG ATTGTGAAGG AGGTGTCGGC GTACCTGAAG 350 

469 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 

AAGGTTGGGT ACAACGTGGA GAAGGTGCGC 
GCAGGGCGAC AACATGATTG ACAAGTCGGA 
GCCCCACGCT GCTGGAGGCA CTCGACATGC 
AGCGACAAGC CGCTGCGCCT GCCGCTGCAG 
5 TATCGGCACC GTGCCGGTCG GTCGCGTGGA 
GCGACGTGGT GACGTTTGCG CCCGCCAACG 
ATTGAGATGC ACCACGAGCA GCTGGCCGAG 
CGGCTTCAAC GTGAAGAACG TGTCCGTGAA 
TGTGCGGCAA CTCGAAGAAC GACCCCCCAA 

10 GCGCAGGTGA TCATCCTGAA CCACCCCGGC 
GCCGGTGCTC GACTGCCACA CCTGCCACAT 
TCGAGTCCAA GATCGACCGC CGCTCCGGCA 
AAGTCGATCA AGTCCGGTGA CGCCGCCATG 
GCCCATGTGC GTGGAGGTGT TCAACGACTA 

15 CC 



PCT/CA00/01150 

TTCATCCCCA TCTCCGGCTG AOQ 

AAATATGCCG TGGTACAAGG 450 

TGGAGCCCCC GGTGCGCCCC 500 

GACGTGTACA AGATCGGCGG 550 

GACGGGCACG ATGAAGCCCG 600 

TGACGACGGA GGTGAAGTCG 650 

GCCACGCCCG GCGACAACGT 700 

GGACATCCGC CGTGGCAACG 750 

AGGAGGCGGC CGACTTCACG 800 

CAGATCGGCA ACGGCTATGC 850 

CGCGTGCAAG TTCGCCGAGA 900 

AGGAGCTTGA GAAGAACCCC 950 

GTGCGCATGG TGCCGCAGAA 1000 

CGCTCCTCTT GGCCGCTTTG 1050 

1052 



2) INFORMATION FOR SEQ ID NO : 841 



20 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1061 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 
25 (D) TOPOLOGY : Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
30 (A) ORGANISM: Trypanosoma cruzi 

(B) STRAIN: CGL-1 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 841 

35 TGAAGGCTGA GCGCGAGCGC GGCATCACGA TCGACATCGC GCTCTGGAAG 50 

TTCGAGTCGC CCAAGTCTGT GTTCACGATC ATCGACGCCC CCGGCCACCG 100 

CGACTTCATC AAGAACATGA TCACGGGCAC GTCTCAGGCG GATGCCGCCG 150 

TCCTTGTCAT TGCGTCATCG CAGGGTGAGT TTGAGGCGGG CATCTCGAAG 200 

GACGGCCAGA CACGCGAGCA CGCGCTGCTC GCCTTCACGC TCGGCGTGAA 250 

40 GCAGATGGTT GTGTGCTGCA ACAAGATGGA CGACAAGTCG GTGAACTTTG 3 00 

CCCAGGAGCG CTACGATGAG ATTGTGAAGG AGGTGTCGGC GTACCTGAAG 350 

AAGGTTGGGT ACAACGTGGA GAAGGTGCGC TTCATCCCCA TCTCCGGCTG 400 

GCAGGGCGAC AACATGATTG ACAAGTCGGA AAATATGCCG TGGTACAAGG 450 

GCCCCACGCT GCTGGAGGCA CTCGACATGC TGGAGCCCCC GGTGCGCCCC 500 

45 AGCGACAAGC CGCTGCGCCT GCCGCTGCAG GATGTGTACA AGATCGGCGG 550 

TATCGGCACC GTGCCGGTCG GTCGCGTGGA GACGGGCACG ATGAAGCCCG 600 

GCGACGTGGT GACGTTTGCG CCCGCCAACG TGACGACGGA GGTGAAGTCG 650 

ATTGAGATGC ACCACGAGCA GCTGGCCGAG GCCACGCCCG GCGACAACGT 700 

CGGCTTCAAC GTGAAGAACG TGTCCGTGAA GGACATCCGC CGTGGCAACG 750 

50 TGTGCGGCAA CTCGAAGAAC GATCCCCCAA AGGAGGCGGC CGACTTCACG 8 00 

GCGCAGGTGA TCATCCTGAA CCACCCCGGC CAGATCGGCA ACGGCTATGC 850 

GCCGGTGCTT GACTGCCACA CCTGCCACAT CGCGTGCAAG TTCGCCGAGA 900 

TCGAGTCCAA GATCGACCGC CGCTCCGGCA AGGAGCTTGA GAAGAACCCC 950 

AAGTCGATCA AGTCCGGTGA CGCCGCCATG GTGCGCATGG TGCCGCAGAA 1000 

55 GCCCATGTGC GTGGAGGTGT TCAACGACTA CGCTCCTCTT GGCCGCTTTG 1050 

CCGTGCGTGA C 1061 



60 2) INFORMATION FOR SEQ ID NO: 842 

470 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 



PCT/CA00/01150 



. (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1062 bases 

(B) TYPE: Nucleic acid 

5 (C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

10 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Trypanosoma cruzi 

(B) STRAIN: PCU-1 



15 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 842 



GCTGAAGGCT GAGCGCGAGC GCGGCATCAC GATCGACATC GCGCTCTGGA 50 

AGTTCGAGTC GCCCAAGTCT GTGTTCACGA TCATCGACGC CCCCGGCCAC 10 0 

CGCGACTTCA TCAAGAACAT GATCACGGGC ACGTCTCAGG CGGACGCCGC 150 

CGTCCTTGTC ATTGCGTCAT CGCAGGGTGA GTTTGAGGCG GGCATCTCGA 200 

2 0 AGGACGGCCA GACACGCGAG CACGCGCTGC TCGCCTTCAC GCTCGGCGTG 250 

AAGCAGATGG TTGTGTGCTG CAACAAGATG GACGACAAGT CGGTGAACTT 300 

CGCCCAGGAG CGCTACGATG AGATTGTGAA GGAGGTGTCG GCGTACCTGA 350 

AGAAGGTTGG GTACAACGTG GAGAAGGTGC GCTTCATCCC CATCTCCGGC 4 00 

TGGCAGGGCG ACAACATGAT TGACAAGTCG GAAAATATGC CGTGGTACAA 450 

25 GGGCCCCACG CTGCTGGAGG CACTCGACAT GCTGGAGCCC CCGGTGCGCC 500 

CCAGCGACAA GCCGCTGCGC CTGCCGCTGC AGGACGTGTA CAAGATCGGC 550 

GGTATCGGCA CCGTGCCGGT CGGTCGCGTG GAGACGGGCA CGATGAAGCC 600 

CGGCGACGTG GTGACGTTTG CGCCCGCCAA CGTGACGACG GAGGTGAAGT 65 0 

CGATTGAGAT GCACCACGAG CAGCTGGCCG AGGCCACGCC CGGCGACAAC 70 0 

30 GTCGGCTTCA ACGTGAAGAA CGTGTCCGTG AAGGACATCC GCCGTGGCAA 750 

CGTGTGCGGC AACTCGAAGA ACGACCCCCC AAAGGAGGCG GCCGACTTCA 800 

CGGCGCAGGT GATCATCCTG AACCACCCCG GCCAGATCGG CAACGGCTAT 850 

GCGCCGGTGC TCGACTGCCA CACCTGCCAC ATCGCGTGCA AGTTCGCCGA 90 0 

GATCGAGTCC AAGATCGACC GCCGCTCCGG CAAGGAGCTT GAGAAGAACC 950 

3 5 CCAAGTCGAT CAAGTCCGGT GACGCCGCCA TGGTGCGCAT GGTGCCGCAG 1000 

AAGCCCATGT GCGTGGAGGT GTTCAACGAC TACGCTCCTC TTGGCCGCTT 1050 

TGCCGTGCGT GA 1062 



40 



50 



2) INFORMATION FOR SEQ ID NO: 843 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 1057 bases 
45 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Babesia bovis 

(B) STRAIN: Suarez-3 

55 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 843 

TACATTGACA GCTGCATTGA CCAAGGTCTG TTCGATGGGT GGTCATGGCG 50 

AGTACACTCC TTATGAAGCA ATTGACCGTG CTCCTGAGGA GCGTAAACGT 100 

GGTATTACTA TAAATTCGAC ACATGTTGAA TATGAGACTA AGAACCGTCA 150 

60 TTACGGTCAC GTGGACTGTC CAGGTCACTC TGATTATGTG AAGAACATGA 200 

471 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 



PCT/CA00/01150 



TATCTGGCGC TGCTCAGATG GATGGTGCCA TATTGGTTGT TTCT^GTGTT 2^0 
GACGGTCCCA TGCCTCAGAC TAAGGAGCAC GTGTTGCTTG CTAAGCAGAT 3 00 

TGGTGTACCT CGTTTAGTTG TGTTTTTGAA CAAGCTTGAC ATGTTAGAGG 350 
ACTCTGAGCT ATTGGAGTTG GTGGAGTTAG AGGTTCGTGA GTTATTGAGT 400 
5 GAGTTTGGTT ACGACGGTGA CAACACGCCT ATCGTTCGTG GCAGTGCTAT 450 
AAAGGCATTG AACAGTAGTT CCGAGGCTGA CATTAAGCCA ATTCAGGATT 500 
TATTGGATGC GTGTGATGCC TTTTTACTGA CTCCAGAACG TAAGGATGAC 550 
ATGCCGCTCT TGGTTGCTAT TGACGATGTT CTTGCCATTC CTGGCAAGGG 600 
TACTGTTGTA ACCGGTAGGA TAGAGCAGGG CAAGATTCGT TGTGGTGACC 650 

10 CTATTGAGGT TTGCGCCGGT CCGAAGTCCG GCAAGAAGAC TGTGTGTGTT 700 
GGTCTTRARA TGTTCCGCAA GAGTCTCAGT RAGGGTATTG CTGGTGACCA 750 
GATTGGTGTT TTGCTCAAGG GTGTGAAGCG CGACRAGGTA GAGCGCGGGT 800 
TTGTATTGAT TCAACCCGGA AGTTACAAAT GTCACGGTGA ATTTGATGCT 850 
GACTTGTACG TGTTGACTAC GRAGGAAGGT GGGCGCAAGC ATCCGTTTGT 900 

15 GTCTAACTAC CGTCCTCAGG CGTTTATACG TACTGGAGAC GTTTGCTGCT 950 

CAGTTCATTT GGATRAGGGT GTTGAGATGG CAGCTCCTGG TGACAACGTG 10 0 0 

CGTTGCAAGA TCAAGTTACT TTATCCCATG CCTGTCCATG AAGGTTTACG 1050 

ATTTGCG 1057 



20 



35 



2) INFORMATION FOR SEQ ID NO: 844 



(i) SEQUENCE CHARACTERISTICS: 
25 (A) LENGTH: 943 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

30 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Lelshmania aethiopica 

(B) STRAIN: ATCC 50119 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 844 



CGGTGGCATC ATTGTGGTGG CGGCCACCGA CGGCGTCATG CCGCAGACAC 50 

GCGAGCACCT CCTGATCTGC TCGCAGATTG GGCTTCCGGC GCTCGTAGGG 100 

40 TTCATCAACA AAGTGGATAT GACGGACGAG GACACGTGCG ACCTGGTGGA 150 

CATGGAGGTG CGCGAGCAGC TGGAGAAATA CAAGTTTCCG GCGGAGGAGA 200 

CACCAATCGT GCGCGGCTCG GCCCTCAAGG CCGTCGAGGG CGACGCGAAG 250 

TACGAGGAGA ACATCCTCGA ACTGGTGCGG AAGTGCGACG AGTGGATCCC 3 00 

CGACCCGCCG CGCAACACAG ACAAGCCTTT CCTTATGGCC ATCGAGCACG 3 50 

45 TTTACGAGAT CGGCAAGGAC AAGAAGAGCG TCATCGTGAC CGGCCGCGTC 400 

GATCAGGGCG TGCTGAAGCT CAACACAGAC GCCGAGCTGG CCGGCTTCAG 450 

CGCCAAGAAG TCGACGGTGA GGGTGACGGG CATCGAGATG TACCACAAGA 500 

CGCTGAGCGA GTGCATGCCC GGTGACTCCG TCGGCGTCAG CATTGTCGGC 550 

ACCGGCGACA CAACCAGTCT GTCCAAGGAC AACGTGGAAC GCGGCATGGT 600 

50 AATGGCGGCG ACGGGTAGCA CGAACCTGTA CAACAAGGTG AAGGCGCAGG 650 

TGTACGTGCT GACGAAGGAT GAGGGCGGCC GCCACACCGG CTTCAGCCCC 7 00 

CACTACCGCC CGCAGCTCTT CTTCCATTGC GCTGACGTGA CAGCGGACAT 750 

GAGCTTCCCG GAGGCGGAGA AGCACCGGGA GGAGCTGAAC AAGAAATTCG 800 

GCCGCGGCCC CGAGGAGGAC AAGAAGAAAG AGGCGGAGAT GAAGGAGTTC 850 

55 GAGAGCAAGC TCGTCTGCAT GCCGGGCGAT AACCGCGAGC TGATCCTGAC 900 

GCTCGCGTAC CCGATGCCCA TTGAAAAGGG CCTGAAGTTC ACC 943 



60 2) INFORMATION FOR SEQ ID NO: 845 
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10 



15 



20 



25 



30 



35 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 939 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Leishmania amazonensis 

(B) STRAIN: ATCC 50131 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 845 



CGGTGGCATC ATTGTGGTGG 
GCGAGCATCT CTTGATCTGC 
TTCATCAACA AAGTGGACAT 
CATGGAGGTG CGTGAGCAGC 
CGCCCATCGT GCGCGGCTCG 
TACGAGGAGA ACATCCTCGA 
CGACCCGCCG CGCAACACAG 
TTTACGAGAT CGGCAAGGAC 
GATCAGGGCG TGCTGAAGCT 
CGC CAAGAAG TCGACGGTGA 
CGCTGAGTGA GTGCATGCCC 
ACCGGCGACA CGATCAGTCT 
AATGGCGGCA ACGGGTAGCA 
TGTACGTGCT GACGAAGGAT 
CACTACCGCC CGCAGCTCTT 
GAGCTTCCCG GAGGCGGAGA 
GCCGCGGCCC CGAGGAGGAC 
GAGAGCAAGC TCGTCTGCAT 
GCTGGCGTAC CCGATGCCCA 



CGGCCACCGA 
TCGCAGATTG 
GACGGACGAG 
TGGAGAAATA 
GCCCTCAAAG 
ACTGGTGCGG 
ACAAGCCTTT 
AAGAAGAGCG 
CAACACAGAC 
GGGTGACGGG 
GGTGACTCCG 
CTCCAAGGAC 
CGAACCTGTA 
GAGGGCGGCC 
CTTCCATTGT 
AGCACCGCGA 
AAGCAGAAGG 
GCCGGGCGAC 
TTGACAAGGG 



CGGCGTCATG 
GGCTTCCGGC 
GACACGTGCG 
CAAGTTTCCG 
CCGTCGAGGG 
AAGTGCGACG 
CCTTATGGCC 
TCATCGTGAC 
GCCGAGCTGG 
CATCGAGATG 
TCGGCGTCAG 
AACGTTGAAC 
CAACAAGGTG 
GCCACACTGG 
GCTGACGTGA 
GGAGCTCAAC 
AGGCGGAGAT 
AACCGCGAGC 
TCTGAAGTT 



CCGCAGACAC 50 

GCTCGTAGGG 100 

ACCTGGTGGA 150 

GCGGAAGAGA 2 00 

CGACGCGAAG 250 

AATGGATCCC 3 00 

ATTGAGCACG 350 

CGGCCGCGTC 400 

CCGGCTTCAG 450 

TACCACAAGA 50 0 

CATTGTCGGC 550 

GCGGCATGGT 600 

AAGGCGCAGG 650 

CTTCAGCCCC 700 

CGGCGGACAT 750 

AAGAAATTCG 800 

GAAAGAGTTC 850 

TGATCCTGAC 900 

939 



2 ) INFORMATION FOR SEQ ID NO: 846 

40 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 945 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



45 



(ii) MOLECULE TYPE: Genomic DNA 



(Vi) ORIGINAL SOURCE: 

(A) ORGANISM: Leishmania donovani 
50 (B) STRAIN: ATCC 50212 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 846 

CATTGTGGTG GCGGCCACCG ACGGCGTCAT GCCGCAGACA CGCGAGCACC 50 

55 TCCTGATCTG CTCGCAGATC GGGCTTCCGG CGCTCGTAGG GTTCATCAAC 100 

AAGGTGGACA TGACGGACGA GGACACGTGC GACCTGGTGG ACATGGAGCT 150 

GCGCGAGCAG CTGGAGAAAT ACAAGTTTCC GGCGGAGGAG ACGCCAATCG 200 

TGCGCGGCTC AGCCCTCAAA GCCGTCGAGG GCGATGCGAA GTACGAGGAG 250 

AACATTCTCG AACTGGTGCG GAAGTGTGAC GAGTGGATCC CTGACCCGCC 300 

60 GCGCAACACA GACAAGCCTT TCCTTATGGC CATCGAGCAC GTTTACGAGA 350 

473 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 PCT/CA00/01 150 

TCGGCAAGGA CAAGAAGAGC GTTGTCGTGA~ CCGGCCGCGT CGATckGGGC 4Q# 

ATTCTGAAGC TCAACACAGA CGCCGAGCTG GCCGGCTTCA GCTCCAAGAA 450 

GTCGACGGTG AGGGTGACGG GCATCGAGAT GTACCACAAG ACGCTGAGCG 500 

AGTGCATGCC TGGTGACTCC GTCGGCGTCA GCATTGTCGG GACCGGCGAC 550 

5 ACGACCAGTC TATCCAAGGG CAACGTGGAA CGCGGCATGG TGATGGCGGC 600 

GACGGGTAGC ACGAACCTGT ACAACAAGGT GAAGGCGCAG GTGTACGTGC 650 

TGACGAAGGA TGAGGGCGGC CGCCACACTG GCTTTAGTCC TCACTACCGC 700 

CCGCAGCTCT TCTTCCATTG TGCTGACGTG ACGGCGGACA TGAGCTTCCC 750 

GGAGGCGGAG AAGCACCGCG AAGAGCTCAA CAAGAAATTC GGCCGCGGCC 800 

10 CCGAGGAGGA CAAGAAGAAA GAGGCAGCGA TGAAGGAGTT CGAGAGCAAG 850 

CTCGTCTGCA TGCCGGGCGA TAACCGCGAG CTGATCCTGA CGCTGGCGTA 900 

CCCGATGCCC ATTGAAAAGG GTCTGAAGTT CACCATCCGT GAGGG 945 



15 



25 



2 ) INFORMATION FOR SEQ ID NO: 84 7 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 93 9 bases 
20 (B) TYPE: Nucleic acid 

( C ) STRANDEDNE S S : Doubl e 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Leishmania infantum 

(B) STRAIN: MOU 

30 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 84 7 

GGCATCATTG TGGTGGCGGC CACCGACGGC GTCATGCCGC AGACACGCGA 50 

GCACCTCCTG ATCTGCTCGC AGATCGGGCT TCCGGCGCTC GTAGGGTTCA 100 

TCAACAAGGT GGACATGACG GACGAGGACA CGTGCGACCT GGTGGACATG 150 

35 GAGCTGCGCG AGCAGCTGGA GAAATACAAG TTTCCGGCGG AGGAGACGCC 2 00 

AATCGTGCGC GGCTCAGCCC TCAAAGCCGT CGAGGGCGAT GCGAAGTACG 250 

AGGAGAACAT TCTCGAACTG GTGCGGAAGT GTGACGAGTG GATCCCTGAC 3 00 

CCGCCGCGCA ACACAGACAA GCCTTTCCTT ATGGCCATCG AGCACGTTTA 3 50 

CGAGATCGGC AAGGACAAGA AGAGCGTTGT CGTGACCGGC CGCGTCGATC 4 00 

40 AGGGCGTTCT GAAGCTCAAC ACAGACGCCG AGCTGGCCGG CTTCAGCTCC 450 

AAGAAGTCGA CGGTGAGGGT GACGGGCATC GAGATGTACC ACAAGACGCT 500 

GAGCGAGTGC ATGCCTGGTG ACTCCGTCGG CGTCAGCATT GTCGGCACCG 550 

GCGACACGAC CAGTCTATCC AAGGGCAACG TGGAACGCGG CATGGTGATG 600 

GCGGCGACGG GTAGCACGAA CCTGTACAAC AAGGTGAAGG CGCAGGTGTA 6 50 

45 CGTGCTGACG AAGGATGAGG GCGGCCGCCA CACTGGCTTT AGTCCTCACT 700 

ACCGCCCGCA GCTCTTCTTC CATTGTGCTG ACGTGACGGC GGACATGAGC 750 

TTCCCGGAGG CGGAGAAGCA CCGCGAAGAG CTCAACAAGA AATTCGGCCG 800 

CGGCCCCGAG GAGGACAAGA AGAAAGAGGC AGCGATGAAG GAGTTCGAGA 850 

GCAAGCTCGT CTGCATGCCG GGCGATAACC GCGAGCTGAT CCTGACGCTG 900 

50 GCGTACCCGA TGCCCATTGA AAAGGGTCTG AAGTTCACC 939 



55 



2) INFORMATION FOR SEQ ID NO: 848 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 933 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 
60 (D) TOPOLOGY: Linear 

474 
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(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
5 (A) ORGANISM: Leishmania enriettii 

(B) STRAIN: ATCC 50120 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 848 

10 CACCGACGGC GTCATGCCGC AGACACGGGA GCACCTGCTC ATCTGCTCGC 50 

AGATCGGGCT GCCGGCGCTT GTAGGGTTCA TCAATAAAGT TGACATGACG 100 

GACGAGGATA CGTGCGACCT CGTGGACATG GAGGTGCGGG AACAGCTGGA 150 

GAAGTACAAG TTTCCGGCCG AGGAGACGCC CATCGTACGT GGCTCGGCCC 200 

TCAAGGCCCT CGAGGGGGAT GCGCAATACG AGGGGAGTAT TCTCGAGCTG 250 

15 GTGCGAAAGT GCGACGAGTG GATCCCCGAC CCGCCGCGCA ACACCGAAAA 300 

GCCTTTCCTC ATGGCTATCG AGCACGTTTA CGAGCTCGGC AAAGACAAGA 350 

AGAGCGTCAT CGTTACCGGC CGCGTCGATC AAGGTGTGCT GAAGCTCAAC 400 

ACAGACGCCG AGCTGGCCGG CTTCAGCGCC AAGAAGGCGA CAGTCAAAGT 450 

GACGGGCATC GAGATGTATC ACAAGACACT CAATGAGTGC ATGCCCGGCG 500 

20 ACTCTGTCGG TGTCAGCATC GTCGGTACCG GTGACACGAC CAGCTTATCC 550 

AAGGATAATG TTGAGCGCGG TATGGTAATG GCGGCAACGG GTAGCACGAA 600 

CCTGTACAAC AAGCTGAAGG CGCAGGTTTA CGTGCTGACA AAGGAGGAGG 650 

GTGGCCGCCA CACCGGGTTC AGCCCCCACT ACCGCCCGCA GCTCTTCTTC 700 

CACTGCGCTG ACGTGACCGC AGACATGAGC TTCCCGGAGG CGGAGAAGTA 750 

25 CCGCGAGGAG CTCAACAAGA AGTTCGGCCG TGGCCCTGAG GAGGACAAGA 800 

AGAAAGAGGC GGAGATGAAG GAGTTCGAAA GCAAACTTGT CTGCATGCCA 850 

GGCGATAACC GCGAGCTGAT CCTAACTCTG GCGTACCCGA TGCCCATCGA 900 

CAAGGGCCTG AAGTTCACCA TCCGTGAGGG CGG 933 

30 

2) INFORMATION FOR SEQ ID NO: 849 

(i) SEQUENCE CHARACTERISTICS : 
35 (A) LENGTH: 943 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

40 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Leishmania gerbilli 

(B) STRAIN: ATCC 50121 

45 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 849 

CGGTGGCATC ATTGTGGTGG CGGCCACCGA CGGCGTCATG CCGCAGACAC 50 

GCGAGCACCT CCTGATCTGC TCGCAGATTG GGCTTCCGGC GCTCGTAGGG 100 

50 TTCATCAACA AAGTGGACAT GACGGACGAG GACACGTGCG ACCTGGTGGA 150 

CATGGAGGTG CGCGAGCAGC TGGAGAAATA CAAGTTTCCG GCGGAGGAGA 200 

CACCAATCGT GCGCGGCTCG GCCCTCAAGG CCGTCGAGGG CGACGCGAAG 250 

TACGAGGAGA ACATCCTCGA ACTGGTGCGG AAGTGCGACG AGTGGATCCC 300 

CGACCCGCCG CGCAACACAG ACAAGCCTTT CCTTATGGCC ATCGAGCACG 350 

55 TTTACGAGAT CGGCAAGGAC AAGAAGAGCG TCATCGTGAC CGGCCGCGTC 400 

GATCAGGGCG TGCTGAAGCT CAACACGGAC GCCGAGCTGG CCGGCTTCAG 450 

CGCCAAGAAG TCGACGGTGA GGGTGACGGG CATTGAGATG TACCACAAGA 500 

CGCTGAGCGA GTGCATGCCC GGTGACTCCG TCGGCGTCAG CATTGTCGGC 550 

ACCGGCGACA CGACCAGTCT GTCCAAGGAC AACGTGGAAC GCGGCATGGT 600 

60 AATGGCGGCG ACGGGTAGCA CGAACCTGTA CAACAAGGTG AAGGCGCAGG 650 

475 
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5 GAGAGCAAGC TCGTCTGCAT GCCGGGCGAT 
GCTGGCGTAC CCGATGCCCA TTGAAAAGGG 



PCT/CA00/01150 

GCCACACTGG CTTCA&CCCC ' ^0"(/ 

GCTGACGTGA CAGCGGACAT 750 

GGAGCTCAAC AAGAAATTCG 800 

AGGCGGAGAT GAAGGAGTTC 850 

AACCGCGAGC TGATCCTGAC 900 

TCTGAAGTTC ACC 943 



10 2) INFORMATION FOR SEQ ID NO : 850 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 918 bases 

(B) TYPE: Nucleic acid 
15 (C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

20 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Leishmania major 

(B) STRAIN: ATCC 50122 



25 



55 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 850 



GGCATCATTG TGGTGGCGGC CACCGACGGC GTCATGCCGC AGACACGCGA 50 

ACACCTCCTG ATCTGCTCGC AAATTGGCCT TCCGGCGCTC GTAGGGTTCA 100 

TCAACAAAGT GGACATGACG GACGAGGACA CGTGTGACCT GGTGGACATG 150 

GAGGTGCGCG AGCAGCTGGA GAAATACAAG TTTCCGGCGG AGGAGACACC 2 00 

30 AATCGTGCGC GGCTCGGCCC TCAAGGCCGT CGAGGGCGAC GCGAAGTACG 250 

AGGAGAACAT CCTCGAACTG GTGCGGAAGT GCGACGAGTG GATCCCCGAC 3 00 

CCGCCGCGCA ACACAGACAA GCCTTTCCTT ATGGCCATCG AGCACGTTTA 350 

CGAGATCGGC AAGGACAAGA AGAGCGTCAT CGTGACCGGC CGCGTCGATC 4 00 

AGGGCGTGCT GAAGCTCAAC ACAGACGCCG AGCTGGCCGG CTTCAGCGCC 450 

35 AAGAAGTCGA CGGTGAGGGT GACGGGCATT GAAATGTACC ACAAGACGCT 500 

GAGCGAGTGC ATGCCCGGTG ACTCCGTCGG CGTCAGCATT GTCGGCACCG 550 

GCGACACGAC CAGTCTGTCC AAGGACAACG TGGAGCGCGG CATGGTAATG 600 

GCGGCGACGG GTAGCACGAA CCTGTACAAC AAGGTGAAGG CGCAGGTGTA 650 

CGTGCTGACG AAGGATGAGG GCGGCCGCCA CACTGGCTTC AGCCCCCACT 700 

40 ACCGCCCGCA GCTCTTCTTC CATTGCGCTG ACGTGACAGC GGACATGAGC 750 

TTCCCGGAGG CGGAGAAGCA CCGCGAGGAG CTCAACAAGA AATTCGGCCG 800 

CGGCCCCGAG GAGGACAAGA AGAAAGAGGC GGAGATGAAG GAGTTCGAGA 850 

GCAAGCTCGT CTGCATGCCG GGCGATAACC GCGAGCTGAT CCTGACGCTG 900 

GCGTACCCGA TGCCCATT 918 

45 

2 ) INFORMATION FOR SEQ ID NO : 851 

50 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 93 9 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Leishmania mexicana 
60 (B) STRAIN: ATCC 50156 

476 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 851 



CGGTGGCATC 
5 GCGAGCATCT 
TTCATCAACA 
CATGGAGGTG 
CGCCCATCGT 
TACGAGGAGA 

10 CGACCCGCCG 
TTTACGAGAT 
GATCAGGGCG 
CGTCAAGAAG 
CGCTGAGTGA 

15 ACCGGCGACA 
AATGGCGGCA 
TGTACGTGCT 
CACTACCGCC 
GAGCTTCCCG 

2 0 GCCGCGGCCC 
GAGAGCAAGC 
GCTGGCGTAC 



ATTGTGGTGG 
CCTGATCTGC 
AAGTGGACAT 
CGTGAGCAGC 
GCGCGGCTCG 
ACATCCTCGA 
CGCAACACAG 
CGGCAAGGAC 
TGCTGAAGCT 
TCGACGGTGA 
GTGCATGCCC 
CGATCAGTCT 
ACGGGTAGCA 
GACGAAGGAT 
CGCAGCTCTT 
GAGGCGGAGA 
CGAGGAGGAC 
TCGTCTGCAT 
CCGATGCCCA 



CGGCCACCGA 
TCGCAGATTG 
GACGGACGAG 
TGGAGAAATA 
GCCCTCAAGG 
ACTGGTGCGG 
ACAAGCCTTT 
AAGAAGAGCG 
CAACACAGAC 
GGGTGACGGG 
GGTGACTCCG 
CTCCAAGGAC 
CGAACCTGTA 
GAGGGCGGCC 
CTTCCATTGT 
AGCACCGCGA 
AAGCAGAAGG 
GCCGGGCGAC 
TTGAGAAGGG 



CGGCGTCATG 
GGCTTCCGGC 
GACACGTGCG 
CAAGTTTCCG 
CCGTCGAGGG 
AAGTGCGACG 
CCTTATGGCC 
TCATCGTGAC 
GCCGAGCTGG 
CATCGAGATG 
TCGGCGTCAG 
AACGTTGAAC 
CAACAAGGTG 
GCCACACTGG 
GCTGACGTGA 
GGAGCTCAAC 
AGGCGGAGAT 
AACCGCGAGC 
TCTGAAGTT 



CCGCAGACAC 50 

GCTCGTAGGG 100 

ACCTGGTGGA 15 0 

GCGGAAGAGA 2 00 

CGACGCGAAG 2 50 

AATGGATCCC 3 00 

ATTGAGCACG 350 

CGGCCGCGTC 4 00 

CCGGCTTCAG 450 

TACCACAAGA 50 0 

CATTGTCGGC 550 

GCGGCATGGT 600 

AAGGCGCAGG 650 

CTTCAGCCCC 700 

CGGCGGACAT 750 

AAGAAATTCG 8 00 

GAAAGAGTTC 850 

TGATCCTGAC 900 

939 



25 



2) INFORMATION FOR SEQ ID NO: 852 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 912 bases 
30 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

( D ) TOPOLOGY : Linear 



35 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Leishmanial tarentolae 

(B) STRAIN: MOU-2 

40 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 852 

TCATTGTGGT GGCCGCCACC GACGGCGTCA TGCCGCAAAC ACGGGAGCAC 50 

CTTTTGATCT GCTCGCAGAT CGGGCTGCCG GCGCTCGTAG GGTTCATCAA 100 

CAAAGTGGAC ATGACAGACG AAGACACGTG CGACCTGGTA GACCTGGAGG 150 

45 TGCGTGAGCA GCTGGAGAAG TACAAGTTTC CGGCAGAGGA AACACCAATC 200 

GTGCGTGGCT CGGCCCTCAA GGCCGTTGAG GGCGATGCAA AGTACGAGGA 250 

GAACATCCTC GAACTGGTGC GGAAGTGCGA CGAGTGGATC CCAGACCCGC 3 00 

CACGCAATAC GGACAAGCCT TTCCTTATGG CCATTGAACA CGTGTACGAG 3 50 

ATCGGCAAGG ATAGGAAAAG CGTCATCGTA ACCGGCCGCG TCGATCAAGG 4 00 

50 TGTGCTGAAG CTGAACACAG ACGCCGAGCT GGCCGGCTTC AGCGCCAAGA 450 

AGTCGACGGT GAAAGTGACG GGCATTGAGA TGTACCACAA GACACTGACA 50 0 

GAGTGCATGC CCGGCGACTC TGTCGGCGTC AGCATTGTGG GCACTGGYGA 550 

CACGACCAGC CTCTCTAAGG ACAATGTTGA GCGTGGCATG GTACTGGCCG 600 

CTACGGGTAG CACGAACCTG TACAACAAAG TAAAGGCGCA GGTGTATGTA 650 

55 CTCACGAAGG ATGAGGGCGG CCGCCACACC GGCTTCAGCC CCCACTACCG 700 

TCCGCAGCTC TTCTTCCACT GCGCTGACGT AACGGCGGAC ATGAGCTTCC 750 

CGGAGGCGGA GAAGCACCGC GAGGAACTCA ATAAGAAATT CGGCCGCGGC 8 00 

CCCGAGGAGG ACAAGAAAAA GGAGGCGGAG ATGAAGGAGT TCGAGAGCAA 850 

GCTGGTCTGC ATGCCAGGCG ATAACCGCGA GCTGATCCTG ACATTGGCGT 900 

60 ACCCGATGCC TA 912 

477 
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2) INFORMATION FOR SEQ ID NO: 853 

5 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 93 6 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
10 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
15 (A) ORGANISM: Trypanosoma, cruzi 

(B) STRAIN: MM3 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8 53 

20 ATTCTTGTGG TGGCAGCTAA CGACGGATGC ATGCCGCAGA CGCGTGAGCA 50 

CCTGCTTATT TGTTCGCAGA TTGGCCTTCC TGCTCTTGTA TGCTTTATCA 100 

ATAAGTGTGA CATGATGCAA GGGCAGGAGG AAATGATTGA ACTTGTTGAA 150 

ATGGAGGTAC GTGAACTTTT GGAGAAGTAC AAGTTCCCTG CGGAGGAGAC 200 

GCCATTTGTG CGGGGGTCTG CGGTGAAGGC ATTGGAGGGT GATGCTGAAA 250 

25 ATGAAGGAAA GATTTTGGAG CTTGTAAAAA AATGTGATGA ATGGATTCCC 3 00 

GACCCACCGC GTGCCATTGA AAAACCGTTC CTTATGGCCA TTGAGCACGT 3 50 

TTTTGAGGTT GGAAAGGATA AGAAGGCCGT TGTTGTGAGC GGGCGTGTGG 4 00 

ACCAGGGGCA GTTGAAGGTC GGCGCAGATG CAGAACTTTC CGGGTTTAGC 450 

GCAAAGAAGC TGACGGTGAA GGTTGCTAGC ATCGAAATGT ACCATAAAAT 500 

30 TCTGGAGGAT TGCATGCCTG GTGACTCTGT TGGCGCGAAG ATCGTTGGCA 550 

GCGGTGAAAC AGTGAACCTG TCGAAGGAAA ATGTGGAACG CGGCATGGTA 600 

CTCTCCGCAC CAGGTGCAAC GACACTGTTC AACAAGGTCC GCGCGCAGGT 650 

GTACGTGTTG ACAAAGGAAG AAGGCGGTCG TCACACAGCC TTTAGTCCTC 700 

ACTATCGTCC GCAGCTTTTC TTCCACTGTG CTGATGTCAC GGCAGATATT 750 

35 AACTTCCCGG AAAGCGAGAA GCTTGCAGGG GAGCTGAACA AAAAGTATGG 800 

CCGTGATGCG GCGGAACAGA AGAAGAAGGA GGCAGAACTG AAAGAGTTTG 850 

AAAAGACGCT TGTCTGCATG CCTGGTGATA ACCGCGAACT CCTGCTCACC 900 

CTTGCCTATC CAATGCCAAT GGAAAAGGGA CTCAAG 93 6 



40 



55 



2) INFORMATION FOR SEQ ID NO: 854 



(i) SEQUENCE CHARACTERISTICS: 
45 (A) LENGTH: 934 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

50 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Trypanosoma, cruzi 

(B) STRAIN: PCU-1 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 854 



CGGCATTCTT GTGGTGGCAG CTAACGACGG ATGCATGCCG CAGACGCGTG 50 

AGCACCTGCT TATTTGTTCG CAGATTGGCC TTCCTGCTCT TGTATGCTTT 100 

60 ATCAATAAGT GTGACATGAT GCAAGGG C AG GAGGAAATGA TTGAGCTTGT 150 

478 
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TGAAATGGAG GTACGTGAAC TTTTGGAGAA GTACAAGTTC CCTGCGGkGG 200 

AGACGCCATT TGTGCGGGGG TCTGCGGTGA AGGCATTGGA GGGTGATGCT 250 

GAAAATGAAG GAAAGATTTT GGAGCTTGTA AAAAAATGTG ATGAATGGAT 300 

TCCCGACCCA CCGCGTGCCA TTGAAAAACC GTTCCTTATG GCCATTGAGC 3 50 

5 ACGTTTTTGA GGTTGGAAAG GATAAGAAGG CCGTTGTTGT GAGCGGGCGT 400 

GTGGACCAGG GGCAGTTGAA GGTCGGCGCA GATGCAGAAC TTTCCGGGTT 450 

TAGTGCAAAG AAGCTGACGG TGAAGGTTGC TAGCATCGAA ATGTACCATA 500 

AAATTCTGGA GGATTGCATG CCTGGTGACT CTATTGGCGC GAAGATCGTT 550 

GGCAGCGGTG AAACAGTGAA CCTGTCGAAG GAAAATGTGG AACGCGGCAT 600 

10 GGTACTCTCC GCACCAGGTG CAACGACACT GTTCAACAGG GTCCGCGCGC 650 

AGGTGTACGT GTTGACAAAG GAAGAAGGCG GTCGTCACAC AGCCTTTAGT 700 

CCTCACTATC GTCCGCAGCT TTTCTTCCAC TGTGCTGATG TCACGGCGGA 750 

TATTAACTTC CCGGAAAGCG AGAAGCTTGC AGGGGAGCTG AACAAAAAGT 800 

ATGGCCGTGA CGCGGCGGAA CAGAAGAAGA AGGAGGCAGA ACTGAAAGAG 850 

15 TTTGAAAAGA CGCTTGTCTG CATGCCTGGT GATAACCGCG AACTCCTGCT 900 

CACCCTTGCC TATCCAATGC CAATGGAAAA GGGA 934 



20 2) INFORMATION FOR SEQ ID NO: 855 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 93 7 bases 

(B) TYPE: Nucleic acid 
25 (C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

30 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Trypanosoma, cruzi 

(B) STRAIN: CGL-1 



35 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 855 



TGGCGGCATT CTTGTGGTGG CAGCTAACGA CGGATGCATG CCGCAGACGC 50 

GTGAGCACCT GCTTATTTGT TCGCAGATTG GCCTTCCTGC TCTTGTATGC 100 

TTTATCAATA AGTGTGACAT GATGCAAGGG CAGGAGGAAA TGATTGAACT 150 

TGTTGAAATG GAGGTACGTG AACTTTTGGA GAAGTACAAG TTCCCTGCGG 2 00 

40 AGGAGACGCC ATTTGTGCGG GGGTCTGCGG TGAAGGCATT GGAGGGTGAT 250 

GCTGAAAATG AAGGAAAGAT TTTGGAGCTT GTAAAAAAAT GTGATGAATG 3 00 

GATTCCCGAC CCACCGCGTG CCATTGAAAA ACCGTTCCTT ATGGCCATTG 350 

AGCACGTTTT TGAGGTTGGA AAGGATAAGA AGGCCGTTGT TGTGAGCGGG 4 00 

CGTGTGGACC AGGGGCAGTT GAAGGTCGGC GCAGATGCAG AACTTTCCGG 4 50 

45 GTTTAGCGCA AAGAAGCTGA CGGTGAAGGT TGCTAGCATC GAAATGTACC 500 

ATAAAATTCT GGAGGATTGC ATGCCTGGTG ACTCTGTTGG CGCGAAGATC 550 

GTTGGCAGCG GTGAAACAGT GAACCTGTCG AAGGAAAATG TGGAACGCGG 600 

CATGGTACTC TCCGCACCAG GTGCAACGAC ACTGTTCAAC AAGGTCCGCG 650 

CGCAGGTGTA CGTGTTGACA AAGGAAGAAG GCGGTCGTCA CACAGC CTTT 700 

50 AGTCCTCACT ATCGTCCGCA GCTTTTCTTC CACTGTGCTG ATGTCACGGC 750 

AGATATTAAC TTCCCGGAAA GCGAGAAGCT TGCAGGGGAG CTGAACAAAA 8 00 

AGTATGGCCG TGATGCGGCG GAACAGAAGA AGAAGGAGGC AGAACTGAAA 850 

GAGTTTGAAA AGACGCTTGT CTGCATGCCT GGTGATAACC GCGAACTCCT 900 

GCTCACCCTT GCCTATCCAA TGCCAATGGA AAAGGGA 93 7 

55 

2) INFORMATION FOR SEQ ID NO: 856 

60 (i) SEQUENCE CHARACTERISTICS : 
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(A) LENGTH: 888 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

5 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Babesia, bigemina 
10 (B) STRAIN: Suarez-2 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8 56 

CTTATTATGG AGCTGATCAA CAACGTCGCG AAGAAGCACG GTGGTTTCTC 50 

15 CGTGTTCGCC GGCGTCGGCG AGCGCACCAG GGAGGGCAAC GAGCTGTACC 100 

ACGAGATGAT GGAGACCGGC GTCATCAAGC GCCGCCAGCT GGATGACGGC 150 

ACGTTCGACT TCTCCGGCTC CAAGGCCGCG CTGGTGTACG GCCAGATGAA 200 

CGAGCCGCCA GGTGCCAGGG CGCGTGTTGC CCTCACTGGC CTGACGGTGG 250 

CCGAGTACTT CCGTGATGAG GACGGCCAGG ACGTGCTGCT CTTCATCGAC 3 00 

2 0 AACATCTACC GTTTC AC C C A GGCTGGTTCT GAGGTGAGTG CCCTTTTGGG 350 

GCGCATCCCG TCCGCCGTCG GTTACCAGCC GACCCTCGCC ACCGACCTTG 4 00 

GCGCGCTGCA GGAGCGTATC AC GACGACCA ACAAGGGCTC CATCACCTCC 450 

GTGCAGGCCG TCTACGTGCC GGCCGACGAT ATCACCGACC CGGCGCCTGC 500 

GACCACCTTC ACCCATCTGG ACGCGACCAC TGTGCTCTCC CGTTCCATCG 550 

25 CCGAGCTGGG TATCTACCCC GCCGTCGACC CGCTCGACTC CACCTCGCGT 600 

ATGCTGTCCG CGAACATCGT CGGCGAGGAG CAGTACAACG TGGCGCGTGG 650 

CGTGCAGAAA ATACTGCAGG ACTACAAATC GCTGCAGGAT ATCAT CGCCA 700 

TCCTGGGTAT GGACGAGCTG TCTGAGCAGG ACAAGTT CGT CGTCGCGCGT 750 

GCGCGCAAGG TTCAGCGTTT CCTATCCCAG CCCTTCCAGG TGGCTGAGGT 800 

30 ATTCACCGGC AAGCCCGGAC GTTTCGTCGA GCTGCAGGAC ACCATCAGCG 850 

GCGTCAAGGA GATTTTGGAC GGCGAGTGCG ACGACATG 888 



35 2) INFORMATION FOR SEQ ID NO: 857 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 884 bases 

(B) TYPE: Nucleic acid 
40 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

45 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Babesia bovis 

(B) STRAIN: Suarez-3 



50 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 857 



TGATTATGGA ATTGATCAAC AATGTCGCCA AGAAACACGG TGGGTTCTCC 50 

GTGTTCGCTG GTGTTGGTGA ACGTACGAGG GAAGGTAACG AACTGTACCA 100 

TGAAATGATG GAAACGGGTG TCATCAAGCG CCGTCAACTG GAAGACGGAA 150 

CATTTGACTT CTCGGGCTCT AAAGCTGCTT TGGTGTACGG ACAAATGAAC 200 

55 GAACCACCAG GTGCTAGAGC CCGTGTTGCA CTCACGGGAT TGACCGTTGC 250 

CGAGTATTTC CGTGATGAAG AGGGGCAGGA TGTGCTACTC TTCATCGATA 3 00 

ACATCTACCG TTTCACCCAG GCCGGTTCCG AAGTGAGTGC GCTGTTAGGA 3 50 

AGAATTCCAT CCGCCGTGGG TTATCAACCT ACATTGGCCA CTGATCTCGG 400 

AGCACTCCAG GAACGCATTA CTACAACCAA CAAGGGTTCG ATTACATCAG 450 

60 TCCAGGCAGT ATACGTCCCA GCCGATGATA TCACTGATCC CGCTCCAGCT 500 
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ACCACTTTCT CGCACTTGGA 
GGAGTTGGGT ATTTACCCTG 
TGCTGTCGGC CAACATTGTA 
GTACAGAAAA TTTTACAGGA 
TCTGGGTATG GACGAGCTGT 
CCCGTAAGGT ACAGCGTTTC 
TTCACCGGCA AGCCTGGGAG 
TGTCAAGGAA ATCTGGAAGG 



TGCCACTACA 
CGGTCGACCC 
GGACAGGAAC 
CTACAAATCA 
CTGAGCAGGA 
CTGTCTCAGC 
GTTCGTTGAA 
TGAGTGTGAC 



GTGCTTTCTC GTTCAATTGC 
GCTTGACTCA ACGTCACGTA 
AGTACGATGC CGCACGTGGT 
CTGCAGGATA TCATTGCCAT 
CAAGTTCGTT GTAGCACGCG 
CGTTCCAAGT GGCTGAGGTG 
CTACAGGATA CCATCAGCGG 
GATA 



550 
600 
650 
700 
750 
800 
850 
884 



10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



2) INFORMATION FOR SEQ ID NO : 858 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 871 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double , 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Babesia microtti 

(B) STRAIN: Persing-1 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 858 

TGGAACTGAT TAATAATGTG GCCAAAAAGC ATGGCGGTTA CTCTGTTTTT 50 

GCAGGTGTAG GTGAAAGGAC GAGGGAGGGT AATGAATTGT ACCATGAAAT 10 0 

GATGGAGACA GGTGTTATAA AGAAAAAGGC ACTAGGTGGT GGGAAGTTTG 150 

ATTTCAGTGG ATCTAAAGCA GCGCTGGTCT ATGGACAAAT GAACGAGCCA 200 

CCTGGGGCCC GTGCTAGAGT GGCACTAACT GGATTAACAG TCGCAGAATA 250 

TTTCCGTGAC GAACAAGGAC AAGACGTGTT GTTGTTTATT GATAATATTT 300 

ACCGATTTAC TCAGGCAGGG TCTGAGGTTT CAGCCTTGCT AGGCCGTATA 350 

CCTTCAGCTG TGGGATACCA GCCTACATTG GCAACAGATC TTGGCTGTTT 4 00 

ACAAGAACGA ATTACTACGA CCAAATCTGG TTCAATCACC AGTGTACAAG 450 

CTGTGTATGT GCCAGCAGAT GATATTACTG ATCCAGCGCC TGCCACAACT 500 

TTTACTCACT TGGACGCTAC TACTGTACTT AGCAGGCCAA TTGCTGAACT 550 

CGGTATTTAT CCAGCGGTAG ACCCGTTGGA TTCAACAAGC CGTATGCTAA 600 

GCGCGAACAT TGTGGGAAAT GAACACTATA GTGTAGCCCG TTCCGTGCAG 650 

AAGATACTGC AAGATTACAA ATCGCTTCAG GACATTATTG CCATTTTGGG 700 

TATGGATGAA CTGTCGGAAC AAGACAAAAA TATAGTAGCC CGAGCAAGGA 750 

AGATGCAAAG GTTCTTATCA CAGCCATTCC AAGTGGCGGA AGTTTTTACT 800 

GGTAAACCGG GAAGATTTGT GGAATTGGAA GATACAATTG CCGGGGCACG 850 

AGATATAATT GCGGGTAATT G 871 



2) INFORMATION FOR SEQ ID NO: 859 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 55 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Leishmania guyanensis 

481 
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(B) STRAIN: ATCC 50126 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 859 

5 TGACGGCGCT GGACGTGACG GAGGACCTCG GCCGCGATGA GCCGCTGACG 50 

CTGGAGATCG TGCAGCACTT GGATGCGAAC ACCGGCCGCT GCATTGCGAT 100 

GCAGACGACG GACCTGCTGA AGCTGAAGTC GAAGGTTGTG TCGACCGGCG 150 

GCAACATCTC CGTGCCGGTG GGCCGCGAGA CACTGGGCCG CATCTTCAAC 2 00 

GTGCTGGGCG ACGCGATTGA CCACCGCGGC CCCGTGTGCG AGAAGATGCG 2 50 

10 CATGGCGATC CACGCCGAGG CGCCGAAGCT GGCGGACCAG GCTGCGGAGG 3 00 

ACACGATCCT GACGACCGGC ATCAAGGTGA TCGACCTGAT TCTGCCCTAC 350 

TGCAAGGGCG GCAAGATCGG CCTGTTCGGC GGTGCCGGTG TGGGCAAGAC 400 

TGTGAT CATC ATGGAGCTGA TCAACAACGT CGCGAAGGGG CACGGCGGCT 4 50 

TCTCCGTGTT CGCCGGCGTT GGCGAGCGCA CGCGCGAGGG CACGGACCTG 500 

15 TACCTGGAGA TGATGCAGTC AAAGGTGATT GACCTGAAGG GCGAGTCGAA 550 

GTGCGTGCTT GTGTACGGGC AGATGAACGA GCCCCCGGGT GCGCGCGCGC 600 

GCGTTGCGCA GTCTGCGCTG ACGATGGCCG AGTACTTCCG CGACGTGGAG 650 

GGCCAGAACG TGCTGCTGTT CATCGACAAC ATCTTCCGCT TCACGCAGGC 700 

GAACTCCGAG GTGTCTGCGC TGCTGGGCCG CATCCCGGCC GCCGTGGGTT 750 

20 ACCAGCCGAC GCTTGCGGAG GATCTTGGCA TGCTGCAGGA GCGCATTACG 800 

TCGACGACGA AGGGATCGAT TACGTCTGTG CAGGCTGTGT ACGTGCCTGC 850 

GGATGATATC ACGGACCCCG CGCCCGCGAC GACGTTCTCG CACCTGGACG 90 0 

CGACGACTGT GCTGGACCGC GCGGTGGCGG AGTCGGGCAT CTACCCTGCC 95 0 

GTGAACCCGC TGGAGTGCGC GTCGCGCATC ATGGACCCCG ATGTGATCGA 1000 

25 CGTGGACCAT TACAACGTTG CACAGGATAT CGTCCAGATG CTGACCAAGT 1050 

ACAAGGAGCT GCAGGACATC ATTGCGGTGC TTGGCATCGA CGAGCTGAGC 110 0 

GAGGAGGACA AGGTCGTGGT GGACCGCGCG CGCAAGGTGA CGCGGTTCCT 115 0 

GTCGCAGCCG TTCCAGGTTG CGGAGGTGTT CACCGGCATG ACGGGCCACT 120 0 

ACGTGCAGCT GAGCGACACG GTGGAGTCGT TCTCTGGCCT GCTGATGGGG 1250 

30 TCGTA 1255 



35 



40 



2 ) INFORMATION FOR SEQ ID NO: 860 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1222 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



45 



50 



55 



60 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Leishmanial mexicetna. 
<B) STRAIN: ATCC 50156 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 860 



CTCGGAGGGC 
TTGGCCGCGA 
AACACCGGCC 
GTCGAAGGTT 
AGACGCTGGG 
GGCCCCGTGG 
GCTGGCGGAT 
TGATCGACCT 
GGCGGCGCCG 
CGTCGCGAAG 
GCACGCGCGA 



GTCCCGCCCG 
TGAGCCGCTG 
GCTGCATTGC 
GTGTCGACCG 
CCGCATCTTC 
GTGAGAAGAT 
CAGGCCGCGG 
GATTCTGCCC 
GTGTGGGCAA 
GGCCACGGTG 
GGGCACGGAC 



TGCTGACGGC GCTGGATGTG 
ACGCTGGAGA TCGTGCAGCA 
GATGCAGACG ACGGACCTGC 
GCGGCAACAT CTCTGTGCCG 
AACGTGCTGG GCGACGCGAT 
GCGCATGGCG ATCCACGCCG 
AGGACACGAT CCTGACGACC 
TACTGCAAGG GTGGCAAGAT 
GACCGTGATC ATCATGGAGC 
GTTTCTCGGT GTTTGCCGGC 
CTGTACCTGG AGATGATGCA 
482 



ACGGAGGACC 50 

CCTGGACGCG 100 

TGAAGCTGAA 150 

GTGGGCCGTG 20 0 

CGACCAGCGC 250 

AGGCCCCGAA 300 

GGCATCAAGG 350 

CGGCCTGTTT 400 

TGATTAACAA 450 

GTTGGCGAGC 500 

GTCGAAGGTG 550 
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ATTGACCTGA AGGGCGAGTC GAAGTGCGTG 
CGAGCCCCCG GGTGCGCGCG CGCGCGTTGC 
CGGAGTACTT CCGAGACGTG GAGGGCCAGA 
AACATCTTCC GCTTCACGCA GGCGAACTCC 
5 CCGCATTCCG GCCGCCGTGG GCTACCAGCC 
GTATGCTGCA GGAGCGCATC ACGTCGACGA 
GTGCAGGCCG TGTACGTGCC TGCGGATGAT 
GACGACGTTC TCGCACCTGG ACGCGACGAC 
CGGAGTCGGG GATCTACCCT GCCGTGAACC 
10 ATCATGGACC C CGATGTGAT CGACGTGGAC 
TATCGTGCAG ATGCTGACCA AGTACAAGGA 
TGCTTGGTAT CGACGAGCTG AGCGAGGAGG 
GCGCGCAAGG TGACCCGGTT CCTGTCGCAG 
GTTCACGGGC ATGACGGGCC AC 

15 



PCT/CA00/01150 

CTTGTGTACG GGCAGAT&AA * 600 

GCAGTCTGCG CTGACGATGG 650 

ATGTGCTGCT GTTCATCGAC 70 0 

GAGGTCTCTG CGCTGCTGGG 750 

GACGCTTGCG GAGGATCTTG 800 

CGAAGGGGTC GATCACGTCC 850 

ATCACGGATC CGGCGCCCGC 900 

TGTGCTGGAC CGCGCGGTGG 950 

CGCTGGAGTG CGCGTCGCGT 1000 

CACTACAACG TTGCGCAGGA 1050 

GCTGCAGGAT ATCATTGCGG 1100 

ACAAGGTCGT GGTGGACCGC 1150 

CCGTTCCAGG TTGCGGAGGT 12 00 

1222 



2) INFORMATION FOR SEQ ID NO: 861 

20 (i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 124 6 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



25 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Leishmania tropica, 
30 (B) STRAIN: ATCC 50129 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8 61 

CCGTGCTGAC GGCGCTGGAT GTGACGGAGG ACCTTGGCCG CGATGAGCCG 50 

35 CTGACGCTGG AGATCGTGCA GCACTTGGAC GCGAAC AC CG GCCGCTGCAT 100 

TGCGATGCAG ACGACGGACC TGCTGAAGCT GAAGTCGAAG GTTGTGTCGA 150 

CCGGCGGCAA CATCTCTGTG CCGGTGGGCC GTGAGACGCT GGGCCGCATC 2 00 

TTCAACGTTC TGGGCGACGC GATCGAC GAG CGCGGCCCCG TGGGCGAGAA 250 

GATGCGCATG GCGATCCACG CCGAGGCCCC GAAGCTGGCG GATCAGGCCG 3 00 

40 CGGAGGACAC GATCCTGACG ACCGGCATCA AGGTGATCGA CCTGATTCTG 3 50 

CCCTACTGCA AGGGTGGCAA GATCGGCCTG TTCGGCGGTG CCGGTGTGGG' 4 00 

CAAGACTGTG ATCATCATGG AGCTGATCAA CAACGTCGCG AAGGGCCACG 450 

GTGGTTTCTC CGTGTTTGCC GGCGTTGGCG AGCGCACGCG CGAGGGCACG 5 00 
GACCTGTACC TGGAGATGAT GCAGTCGAAG GTGATTGACC TGAAGGGCGA 550 

45 GTCGAAGTGC GTGCTTGTGT ACGGGCAGAT GAACGAGCCC CCGGGTGCGC 600 
GCGCGCGCGT TGCGCAGTCT GCGCTGACGA TGGCGGAGTA CTTCCGCGAC 650 
GTGGAGGGCC AGAACGTGCT GCTGTTCATC GACAACATCT TCCGCTTCAC 700 
GCAGGCGAAC TCCGAGGTGT CTGCGCTGCT GGGCCGCATT CCGGCCGCCG 750 
TGGGCTACCA GCCGACGCTT GCGGAGGATC TTGGTATGCT GCAGGAGCGC 8 00 

50 ATCACGTCGA CAACGAAGGG GTCGATCACG TCCGTGCAGG CCGTGTACGT 850 
GCCAGCGGAT GATATCACGG ATCCCGCGCC CGCGACGACG TTCTCGCACC 900 
TGGACGCGAC GACTGTGCTG GACCGCGCGG TGGCGGAGTC GGGCATCTAC 950 

CCTGCCGTGA ACCCGCTGGA GTGCGCGTCG CGTATCATGG ACCCCGATGT 1000 

GATCGATGTG GACCACTACA ACGTTGCGCA GGATATCGTG CAGATGCTGA 1050 

55 CCAAGTACAA GGAGCTGCAG GATATCATTG CGGTGCTTGG CATCGACGAG 1100 

CTGAGCGAGG AAGACAAGGT TGTTGTGGAC CGCGCGCGCA AGGTGACCCG 1150 

GTTCCTGTCG CAGCCGTTCC AGGTTGCGGA GGTGTTCACG GGCATGACGG 12 00 

GCCACTACGT GCAGCTGGTC GACACGGTGG AGTCGTTCTC TGGCCT 1246 



60 
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2) INFORMATION FOR SEQ ID NO: 862 

(i) SEQUENCE CHARACTERISTICS: 
5 (A) LENGTH: 1265 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY : Linear 

10 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Leishmania tropica 

(B) STRAIN: ATCC 30815 



15 



45 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 862 



GGCGTGCCGC CCGTGCTGAC GGCGCTGGAT GTGACGGAGG ACCTTGGCCG 50 

CGATGAGCCG CTGACGCTGG AGATCGTGCA GCACTTGGAC GCGAACACGG 10 0 

2 0 GCCGCTGCAT TGCGATGCAG ACGACGGACC TGCTGAAGCT GAAGTCGAAG 150 

GTCGTGTCGA CCGGCGGCAA CATCTCTGTG CCGGTGGGCC GTGAGACGCT 200 

GGGCCGCATC TTCAAYGTTC TGGGCGACGC GATCGACCAG CGCGGCCCCG 250 

TGGGCGAGAA GATGCGCATG GCGATCCACG CCGAGGCCCC GAAGCTGGCG 30 0 

GATCAGGCCG CGGAGGACAC GATCCTGACG ACCGGCATCA AGGTGATCGA 350 

2 5 CCTGATTCTG CCCTACTGCA AGGGTGGCAA GATCGGCCTG TTCGGCGGTG 400 

CCGGTGTGGG CAAGACTGTG ATCATCATGG AGCTGATCAA CAACGTCGCG 450 

AAGGGCCACG GCGGTTTCTC CGTGTTTGCC GGCGTTGGCG AGCGCACGCG 500 

CGAGGGCACG GACCTGTACC TGGAGATGAT GCAGTCGAAG GTGATTGACC 550 

TGAAGGGCGA GTCGAAGTGY GTGCTTGTGT ATGGGCAGAT GAACGAGCCC 600 

3 0 CCGGGTGCGC GCGCGCGCGT TGCGCAGTCT GCGCTGACGA TGGCGGAGTA 650 

CTTCCGCGAC GTGGAGGGCC AGAACGTGCT GCTGTTCATC GACAACATCT 700 

TCCGCTTCAC GCAGGCGAAC TCCGAGGTGT CTGCGCTGCT GGGCCGCATT 750 

CCGGCCGCCG TGGGCTACCA GCCGACGCTT GCGGAGGATC TTGGTATGCT 800 

GCAGGAGCGC ATCACGTCGA CAACGAAGGG GTCGATCACG TCCGTGCAGG 850 

3 5 CCGTGTACGT GCCAGCGGAT GATATCACGG ATCCCGCGCC CGCGACGACG 900 

TTCTCGCACC TGGACGCGAC GACTGTGCTG GACCGCGCGG TGGCGGAGTC 950 

GGGCATCTAC CCTGCCGTGA ACCCGCTGGA GTGCGCGTCG CGTATCATGG 1000 

ACCCTGATGT GATCGATGTG GACCACTACA ACGTTGCGCA GGATATCGTG 1050 

CAGATGCTGA CCAAGTACAA GGAGCTGCAG GATATCATTG CGGTGCTTGG 1100 

40 CATCGACGAG CTGAGCGAGG AAGACAAGGT TGTTGTGGAC CGCGCGCGCA 1150 

AGGTGACCCG GTTCCTGTCG CAGCCGTTCC AGGTTGCGGA GGTGTTCACG 1200 

GGCATGACGG GCCACTACGT GCAGCTGGTC GACACGGTGG AGTCGTTCTC 1250 

TGGCCTGCTG ATGGG 1265 



2) INFORMATION FOR SEQ ID NO: 863 



(i) SEQUENCE CHARACTERISTICS: 
50 (A) LENGTH: 1191 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

55 (ii) MOLECULE TYPE: Genomic DNA 

(Vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bordetella pertussis 

60 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 863 

484 
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ATGGCAAAAG 
GATTGGTCAC 
CGGTGCTGTC 
5 GACGCGGCGC 
CGTTGAGTAC 
GTCACGCTGA 
GGCGCGATCC 
CGAGCACATT 

10 TCCTGAACAA 
GAGATGGAAG 
CACGCCGATC 
GCGACCTGGG 
TACATTCCGA 

15 GGAAGACGTG 
TCGAGCGCGG 
AAGCCGACGG 
GCTGGACCAG 
CCAAGCGTGA 

20 ATCAACCCGC 
AGAGGGTGGC 
ACTTCCGCAC 
GAAATGGTGC 
CCCGATCGCC 

25 GTACCGTCGG 



' GCAAGTTTGA 
GTTGACCACG 
GAACAAGTTC 
CGGAAGAGAA 
GAGACGGAGA 
CTACGTGAAG 
TGGTGGTGTC 
TTGCTGTCGC 
GGCGGACATG 
TCCGCGAACT 
GTGAATGGTG 
CGAGCAGGCG 
CGCCGGAGCG 
TTCTCGATCT 
CGTGGTGAAG 
TGAAGACGAC 
GGCCAGGCGG 
AGACGTCGAG 
ACACGGACTT 
CGTCACACGC 
GACGGACGTG 
TGCCGGGCGA 
ATGGAAGAAG 
TGCCGGCGTC 



ACGTACCAAG 
GCAAAACGAC 
GGCGGCGAGG 
GGCGCGTGGG 
CGCGTCACTA 
AACATGATCA 
GGCCGCAGAC 
GCCAGGTTGG 
GTTGATGACG 
GCTGAGCAAG 
CGGCCAAGCT 
ATTCTGTCGC 
CGCGGTCGAC 
CGGGCCGTGG 
GTTGGCGAGG 
CTGCACGGGC 
GCGACAACGT 
CGTGGCCAGG 
CACGGCCGAG 
CGTTCTTCAA 
ACCGGCACGA 
CAACGTGTCG 
GTCTGCGTTT 
GTCGCCAAGA 



CCGCACGTGA 
GTTGACGGCG 
CTCGCGGCTA 
ATCACGATCA 
CGCGCACGTT 
CGGGTGCTGC 
GGCCCGATGC 
CGTGCCGTAC 
CGGAGCTGCT 
TACGATTTCC 
GGCGCTGGAA 
TGGCGCAAGC 
GGTGCGTTCC 
CACGGTGGTG 
AAATCGAAAT 
GTGGAGATGT 
GGGTATCTTG 
TGCTGGCCAA 
GTGTACATTC 
CGGCT AT CGT 
TCGACCTGCC 
ATGACCGTCA 
CGCCATCCGT 
TCATCAAGTA 



ACGTGGGTAC 50' 

GCGATCACGA 100 

CGACCAGATT 150 

ACACCTCGCA 200 

GATTGCCCGG 250 

GCAGATGGAC 300 

CGCAGACGCG 350 

ATCATCGTGT 400 

CGAGCTGGTG 450 

CGGGCGATGA 500 

AGCGACAACG 550 

GCTGGACACG 600 

TGATGCCGGT 650 

ACTGGCCGTA 70 0 

CGTGGGCATC 750 

TCCGCAAGCT 800 

CTGCGCGGCA 850 

GCCGGGTTCG 900 

TGTCCAAGGA 950 

CCGCAGTTCT 1000 

GGCGGACAAG 1050 

AGCTGCTGGC 1100 

GAAGGCGGTC 115 0 

A 1191 



30 



2 ) INFORMATION FOR SEQ ID NO; 864 



(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1350 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 
35 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
40 (A) ORGANISM: Trypanosoma brucei 

(B) STRAIN: LVH/ 75/ USAMRU- K/ 1 8 

(C) ACCESSION NUMBER: extracted from U10562 



45 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 864 



ATGGGAAAGG AAAAGGTGCA CATGAATCTT GTGGTGGTGG GCCACGTCGA 50 

TGCCGGTAAA TCCACTGCAA CGGGTCACTT GATCTACAAG TGCGGTGGTA 100 

TTGACAAACG TACGATCGAG AAGTTCGAGA AAGAAGCTGC CGACATTGGT 150 

AAGGCCTCAT TCAAGTACGC ATGGGTGCTG GACAAGCTGA AGGCTGAGCG 2 00 

50 CGAACGTGGT ATCACGATCG ACATTGCACT GTGGAAATTC GAGTCACCCA 250 

AGTCTGTCTT CACTATTATT GATGCTCCTG GGCACCGTGA CTTCATCAAG 300 

AACATGATCA CCGGCACATC GCAAGCCGAC GCAGCCATCC TCATCATTGC 350 

CTCTGCGCAG GGTGAGTTCG AGGCTGGTAT CTCCAAGGAT GGACAGACCC 400 

GCGAGCACGC GTTGCTGGCC TTCACTTTGG GTGTGAAGCA GATGGTTGTG 450 

55 TGCTGCAACA AAATGGACGA CAAGACTGTG AACTACGGAC AGGAGCGGTA 500 

TGACGAGATT GTGAAGGAGG TGTCTGCTTA CATCAAGAAG GTTGGGTACA 550 

ACGTGGAGAA GGTGCGCTTC GTCCCCATCT CCGGATGGCA GGGCGACAAC 600 

ATGATTGAGA AATCCGAGAA GATGCCATGG TACAAGGGTC CAACGCTCCT 650 

GGAGGCACTA GACATGCTGG AGCCACCAGT GCGTCCGAGC GACAAGCCCC 7 00 

60 TGCGTCTGCC ACTGCAGACG TGTACAAAGA TCGGTGGTAT TGGCACCGTG 750 

485 
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CCCGTTGGTC GTGTGGAGAC CGGCGTGATG AAGCCTGGTG ATGTGGTGAC 800> 

GTTTGCCCCC GCCAACGTGA CGACCGAGGT GAAATCGATC GAGATGCACC 850 

ACGAGCAGCT CGCTGAGGCG ACCCCCGGTG ACAACGTCGG CTTTAACGTG 900 

AAGAACGTTT CTGTAAAGGA CATCCGCCGT GGCAACGTCT GCGGTAACAC 950 

5 CAAGAACGAC CCCCCAAAGG AGGCCGCCGA CTTCACGGCA CAGGTGATCA 1000 

TCCTGAACCA CCCCGGACAG ATTGGAAACG GTTATGCGCC CGTGCTGGAC 10 50 

TGCCACACAT CGCACATTGC CTGCAAGTTC GCGGAGATCG AGTCGAAGAT 1100 

CGACCGTCGC TCTGGCAAGG AGCTGGAGAA GGCTCCCAAG TCGATCAAGT 1150 

CTGGCGACGC CGCGATCGTG CGCATGGTGC CGCAGAAGCC TATGTGCGTG 120 0 

10 GAGGTCTTCA ACGACTACGC GCCACTCGGC CGCTTTGCCG TGCGTGACAT 1250 

GCGCCAGACC GTCGCTGTCG GTATCATCAA GGCCGTGACC AAGAAGGACG 13 0 0 

GTTCTGGTGG TAAGGTGACG AAGGCTGCGG TGAAGGCTTC GAAGAAATAA 1350 



15 



25 



30 



2) INFORMATION FOR SEQ ID NO: 865 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 1052 bases 
20 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Cryptosporidium parvum 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 865 



AAGCTCAAGG CTGAGAGAGA AAGAGGTATT ACCATCGATA TTGCTTTATG 50 

GCAATTCGAA ACCCCAAAAT ACCACTACAC TGTCATTGAT GCCCCAGGTC 100 

ACAGAGATTT CATCAAGAAT ATGATTACTG GTACCT CTCA AGCTGATGTT 150 

GCTTTATTGG TTGTCCCAGC CGATCGTTTC GAAGGTGCCT TCTCCAAGGA 200 

35 AGGTCAAACC AGAGAACATG CTTTATTGGC CTTCACTTTG GGTGTCAGAC 250 

AAATGATTGT CGGTATTAAC AAGATGGATA CCTGTGAATA CAAGCAATCT 3 00 

CGTTTTGATG AAATCTTCAA CGAAGTTGAT GGTTACCTCA AGAAGGTTGG 350 

TTACAACACC GAGAAGATCC CATTCGTTGC CATTTCTGGT TTCGTTGGTG 4 00 

ATAATATGGT TGAGAGATCT GACAAGATGC CATGGTATAA GGGTAAGACC 450 

40 TTAGTCGAAG CCCTCGACAC TATGGAACCA CCAAAGAGAC CAACTGACAA 500 

GCCACTCCGT CTCCCATTAC AAGATGTTTA CAAGATAGGT GGTGTAGGTA 550 

CTGTCCCAGT CGGTCGTGTT GAGACTGGTA TCATCAGACC AGGTATGAAT 60 0 

GTTACCTTCG CTCCAGCTGG TGTTACCACT GAAGTTAAGT CAGTAGAAAT 650 

GCACCATGAG CAGATGCCAG AGGCCGTCCC AGGTGACAAC GTTGGTTTCA 700 

45 ATGTTAAGAA CGTCTCCATC AAGGATATCA AGAGAGGTTT CGTTGCTTCT 750 

GATGCCAAGA ATGACCCAGC TAAGGGCTGT GAAGACTTCA CTGCTCAAGT 800 

TATCGTCCTC AACCACCCAG GTGAAATCAA GAACGGTTAC TCTCCAGTCG 850 

TTGACTGTCA CACCGCTCAC ATTTCCTGCA AATTCCAGAC TATCACTGCT 900 

AAGATGGACA AGAGATCTGG TAAGGTTTTG GAAGAAAACC CAAAGCTTAT 950 

50 CAAGTCTGGT GATGCTGCTT TGGTTGTTAT GCAACCTTTG AAGCCACTTT 1000 

GTGTTGAGGC CTTCACTGAC TACCCACCTC TAGGTCGTTT CGCTGTCCGT 1050 

GA 1052 



55 

2) INFORMATION FOR SEQ ID NO: 866 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 83 7 bases 
60 (B) TYPE: Nucleic acid 

486 
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(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

5 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Staphylococcus saprophyticus 

(B) STRAIN: ATCC 35552 

10 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 866 

CAATGAAGTT CCAGAAATTA ACAATGCCTT AGTCGTAGAC GTTGAAAGAG 50 

ATGAAGGTAC AGTATCTCTT ACATTAGAAG TGGCATTACA ACTTGGCGAT 100 

GATGTCGTAC GTACAATTGC AATGGATTCT ACTGATGGTG TTAAACGTGG 150 

15 TACAGAAGTT CGAGATAGCG GAGATAGCAT CAGTGTTCCA GTTGGTGATG 2 00 

CTACGTTAGG ACGTGTGTTT AATGTTCTTG GTGATACAAT TGACTTAGAC 250 

GAGAAGCTTG ATACTTCTGT CAAACGTGAT CCAATTCATA GAGAAGCACC 300 

TGCATTCGAT CAATTATCAA CAAAAGTTGA AATCTTAGAA ACAGGTATTA 3 50 

AAGTAATTGA TTTACTTGCA CCATATATTA AAGGTGGTAA AATCGGTTTA 400 

2 0 TTCGGTGGCG CTGGTGTAGG TAAAACAGTA TTAATTCAAG AATTAATTAA 450 

TAATATAGCT CAAGAACATG GTGGTATTTC AGTATTTGCC GGCGTAGGTG 500 

AACGTACGCG TGAAGGTAAT GACTTATACT ACGAAATGAG TGATAGTGGT 550 

GTTATTAAGA AAACAGCTAT GGTCTTCGGA CAAATGAATG AGCCACCTGG 600 

TGCGCGTATG CGTGTTGCTT TATCAGGCTT AACAATGGCT GAACACTTCC 650 

25 GTGATGTACA AGGACAAGAT GTTTTACTAT TTATTGATAA CATATTCAGA 700 

TTTACGCAAG CTGGTTCAGA AGTATCAGCA CTATTAGGTC GTATGCCATC 750 

AGCCGTTGGT TATCAACCTA CCCTTGCTAC TGAAATGGGT CAATTACAAG 8 00 

AACGTATTAC ATCAACAACT AAAGGATCTG TAACGTC 837 



30 
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2) INFORMATION FOR SEQ ID NO : 867 



(i) SEQUENCE CHARACTERISTICS: 
3 5 (A) LENGTH: 818 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

40 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Zoogloea ramigera 

(B) STRAIN: ATCC 25935 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 867 



AAGGTATTCG ATGCCTTGAA AATGGAAGGC TCCGAGCTGA CCCTGGAAGT 50 

ACAACAGCAG CTGGGCGACG GCATTGTCCG TACCATTGCA CTGGGTACCT 100 

50 CCGACGGCCT GCGTCGCGGC ATGATGATCC AGAACACCGG CAAACCTATC 150 

ATGGTGCCAG TCGGTAAAGC AACCCTGGGT CGCATCATGG ACGTGCTGGG 200 

TAACCCGATC GACGAATGCG GCGCGGTCGC TCACGACCAG ATCGCTTCGA 250 

TCCACCGCGC TCCTCCTGCG TACGACGAAC TGTCGCCATC GCAAGATCTG 3 00 

CTGGAAACCG GCATTAAAGT TATTGACCTG GTGTGCCCGT TCGCCAAGGG 350 

55 CGGTAAAGTC GGTCTGTTCG GCGGTGCAGG TGTGGGCAAG ACCGTGAACA 400 

TGATGGAACT GATCAACAAC ATCGCCAAAG CACACTCGGG TCTGTCCGTG 450 

TTTGCCGGTG TGGGTGAGCG TACCCGTGAA GGTAACGACT TCTACCACGA 500 

GATGGCTGAC GCCAAAGTGG TCGATCTGGA AAATCCAGAG AACTCCAAGG 550 

TTGCGATGGT CTACGGTCAG ATGAATGAAC CACCAGGCAA CCGTCTGCGC 600 

60 GTGGCGCTGA CCGGTCTGAC CATGGCTGAA GCATTCCGTG ACGAAGGCAA 650 

487 
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AGACGTTCTG TTCTTCGTGG ACAACATCTA CCTgCTTCACC CTGGCCGC&K " ^OCT 

CCGAAGTATC GGCACTGCTG GGCCGTATGC CATCGGCTGT GGGTTACCAG 750 

CCTACGCTGG CCGAAGAAAT GGGTCGCCTG CAAGAGCGCA TCACTTCGAC 800 

CAAGACCGGT TCGATCAC 818 



2) INFORMATION FOR SEQ ID NO: 868 



10 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 778 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

( D ) TOPOLOGY : Linear 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Staphylococcus saprophyticus 
20 (B) STRAIN: ATCC 43867 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 868 

CTATCTTAGT AGTATCTGCT GCTGATGGCC CAATGCCACA AACTCGTGAA 50 

25 CACATTCTTT TATCACGTAA CGTTGGTGTT CCAGCATTAG TTGTATTCTT 100 

AAACAAAGTT GACATGGTTG ACGATGAAGA ATTATTAGAA TTAGTAGAAA 150 

TGGAAGTTCG TGACTTATTA AGCGAATATG ACTTCCCAGG TGACGATGTA 2 00 

CCTGTAATCT CTGGTTCTGC ATTAAAAGCT TTAGAAGGCG ACGCTGACTA 250 

TGAGCAAAAA ATCTTAGACT TAATGCAAGC TGTTGATGAC TTCATTCCAA 300 

30 CACCAGAACG TGATTCTGAC AAACCATTCA TGATGCCAGT TGAGGACGTA 350 

TTCTCAATCA CTGGTCGTGG TACTGTTGCT ACAGGCCGTG TTGAACGTGG 400 

TCAAATCAAA GTCGGTGAAG AAATCGAAAT CATCGGTATG CAAGAAGAAT 450 

CAAGCAAAAC AACTGTTACT GGTGTAGAAA TGTTCCGTAA ATTATTAGAC 500 

TACGCTGAAG CTGGTGACAA CATTGGTGCA TTATTACGTG GTGTTTCACG 550 

35 TGATGACGTA CAACGTGGTC AAGTTTTAGC TGCTCCTGGT ACTATTACAC 60 0 

CACATACAAA ATTCAAAGCG GATGTTTACG TTTTATCTAA AGATGAAGGT 650 

GGTCGTCATA CACCATTCTT CACTAACTAC CGCCCACAAT TCTATTTCCG 700 

TACTACTGAC GTAACTGGTG TTGTTAACTT ACCAGAAGGT ACTGAAATGG 750 

TTATGC CTGG CGATAACGTT GAAATGGA 778 

40 

2) INFORMATION FOR SEQ ID NO: 869 

45 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 64 0 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Etoterococcus casseliflavus 
55 (B) STRAIN: R689 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 869 

TGGTCCTATG CCTCAAACAC GTGAACACAT CTTGTTATCA CGTAACGTTG 50 
60 GTGTACCATA CATCGTTGTT TTCTTAAACA AAATGGATAT GGTTGATGAC 100 
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GAAGAATTAC TAGAATTAGT TGAAATGGAA GTTCGTGACT TATTGT&AGA 150 

ATATGACTTC CCAGGCGACG ATGTTCCTGT AATCGCTGGT TCTGCTTTGA 200 

AAGCTCTTGA AGGCGATGCT TCATACGAAG AAAAAATCAT GGAATTAATG 250 

GCTGCAGTTG ACGAATACGT TCCAACTCCA GAACGTGACA CTGACAAACC 300 

5 ATTCATGATG CCAGTCGAAG ACGTATTCTC AATCACTGGA CGTGGTACTG 350 

TTGCTACAGG CCGTGTTGAA CGTGGACAAG TTCGCGTTGG TGACGAAGTT 4 00 

GAAATCGTTG GTATTGCTGA AGAAACTGCT AAAACAACTG TAACTGGTGT 450 

TGAAATGTTC CGTAAATTGT TAGACTATGC TGAAGCAGGG GATAACATTG 500 

GTGCATTGCT ACGTGGTGTT GCTCGTGAAG ACATCCAACG TGGACAAGTA 550 

10 TTGGCTAAAG CTGGTACAAT CACACCTCAT ACAAAATTTA AAGCTGAAGT 600 

TTACGTTTTA ACAAAAGAAG AAGGTGGACG TCACACTCCA 640 



15 2) INFORMATION FOR SEQ ID NO : 870 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 644 bases 

(B) TYPE: Nucleic acid 

2 0 (C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

25 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus casseliflavus 

(B) STRAIN: R754 



30 



45 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 870 



GTCCTATGCC TCAAACACGT GAACACATCT TGTTATCACG TAACGTTGGT 50 

GTACCATACA TCGTTGTTTT CTTAAACAAA ATGGATATGG TTGATGACGA 100 

AGAATTACTA GAATTAGTTG AAATGGAAGT TCGTGACTTA TTGTCAGAAT 150 

ATGACTTCCC AGGCGACGAT GTTCCTGTAA TCGCTGGTTC TGCTTTGAAA 200 

35 GCTCTTGAAG GCGATGCTTC ATACGAAGAA AAAATCATGG AATTAATGGC 250 

TGCAGTTGAC GAATACGTTC CAACTCCAGA ACGTGACACT GACAAACCAT 300 

TCATGATGCC AGTCGAAGAC GTATTCTCAA TCACTGGACG TGGTACTGTT 3 50 

GCTACAGGCC GTGTTGAACG TGGACAAGTT CGCGTTGGTG ACGAAGTTGA 4 00 

AATCGTTGGT ATTGCTGAAG AAACTGCTAA AACAACTGTA ACTGGTGTTG 450 

40 AAATGTTCCG TAAATTGTTA GACTATGCTG AAGCAGGGGA TAACATTGGT 500 

GCATTGCTAC GTGGTGTTGC TCGTGAAGAC ATCCAACGTG GACAAGTATT 550 

GGCTAAAGCT GGTACAATCA CACCTCATAC AAAATTTAAA GCTGAAGTTT 600 

ACGTTTTAAC AAAAGAAGAA GGTGGACGTC ACACACCATT CTTC 644 



2 ) INFORMATION FOR SEQ ID NO: 871 



(i) SEQUENCE CHARACTERISTICS: 
50 (A) LENGTH: 637 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

55 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus flavescens 

(B) STRAIN: R758 

60 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 871 

TCCTATGCCT CAAACACGTG AACACATCTT GTTATCACGT AACGTTGGTG 50 

TACCATACAT CGTTGTTTTC TTAAACAAAA TGGATATGGT TGATGACGAA 100 

5 GAATTACTAG AATTAGTTGA AATGGAAGTT CGTGACTTAT TGTCAGAATA 150 

TGACTTCCCA GGCGACGATG TTCCTGTAAT CGCTGGTTCT GCTTTGAAAG 2 00 

CTCTTGAAGG CGATGCTTCA TACGAAGAAA AAATCATGGA ATTAATGGCT 250 

GCAGTTGACG AATACGTTCC AACTCCAGAA CGTGACACTG ACAAACCATT 3 00 

CATGATGCCA GTCGAAGACG TATTCTCAAT CACTGGACGT GGTACTGTTG 350 

10 CTACAGGCCG TGTTGAACGT GGACAAGTTC GCGTTGGTGA CGAAGTTGAA 400 

ATCGTTGGTA TTGCTGAAGA AACTGCTAAA ACAACTGTAA CTGGTGTTGA 450 

AATGTTCCGT AAATTGTTAG ACTATGCTGA AGCAGGGGAT AACATTGGTG 500 

CATTGCTACG TGGGGTTGCT CGTGAAGACA TCCAACGTGG * ACAAGTATTA 550 

GCTAAAGCTG GTACAATCAC ACCTCATACA AAATTTAAAG CTGAAGTTTA 600 

15 CGTTTTAACA AAAGAAGAAG GTGGACGTCA CACTCCA 637 



20 



50 



60 



2) INFORMATION FOR SEQ ID NO: 872 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 643 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 
25 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
3 0 (A) ORGANISM: Enterococcus gallinarum 

(B) STRAIN: R631 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 872 

35 GTCCTATGCC TCAAACTCGT GAACACATCT TGTTATCACG TAACGTTGGC 50 

GTACCATACA TCGTTGTTTT CTTGAACAAA ATGGATATGG TTGATGACGA 100 

AGAATTGCTA GAATTAGTTG AAATGGAAGT TCGTGACCTA TTGTCTGAAT 150 

ATGACTTCCC AGGCGACGAT GTTCCTGTAA TCGCCGGTTC TGCTTTGAAA 200 

GCTCTTGAAG GAGATCCTTC ATACGAAGAA AAAATCATGG AATTGATGGC 250 

40 TGCAGTTGAC GAATACGTTC CAACTCCAGA ACGTGATACT GACAAACCAT 300 

TCATGATGCC AGTCGAAGAC GTATTCTCAA TCACTGGACG TGGTACTGTT 350 

GCTACAGGCC GTGTTGAACG TGGACAAGTT CGCGTTGGTG ATGAAGTAGA 4 00 

AATCGTTGGT ATTGCTGACG AAACTGCTAA AACAACTGTA ACAGGTGTTG 450 

AAATGTTCCG TAAATTGTTA GACTATGCTG AAGCAGGGGA TAACATTGGT 500 

45 GCATTGCTAC GTGGGGTTGC TCGTGAAGAC ATCCAACGTG * GACAAGTATT 550 

GGCTAAAGCT GGTACAATCA CACCTCATAC AAAATTCAAA GCTGAAGTTT 60 0 

ATGTTTTGAC AAAAGAAGAA GGTGGACGTC ACACTCCATT CTT 643 



2) INFORMATION FOR SEQ ID NO: 873 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 641 bases 
55 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

490 
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(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus gallinarum 

(B) STRAIN: R691 

5 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 873 

GTCCTATGCC TCAAACTCGT GAACACATCT TGTTATCACG TAACGTTGGC 50 

GTACCATACA TCGTTGTTTT CTTGAACAAA ATGGATATGG TTGATGACGA 100 

AGAATTGCTA GAATTAGTTG AAATGGAAGT TCGTGACCTA TTGTCTGAAT 150 

t 10 ATGACTTCCC AGGCGACGAT GTTCCTGTAA TCGCCGGTTC TGCTTTGAAA 200 

GCTCTTGAAG GAGATCCTTC ATACGAAGAA AAAATCATGG AATTGATGGC 250 

TGCAGTTGAC GAATACGTTC CAACTCCAGA ACGTGATACT GACAAACCAT 3 00 

TCATGATGCC AGTCGAAGAC GTATTCTCAA TCACTGGACG TGGTACTGTT 350 

GCTACAGGCC GTGTTGAACG TGGACAAGTT CGCGTTGGTG ATGAAGTAGA 4 00 

15 AATCGTTGGT ATTGCTGACG AAACTGCTAA AACAACTGTA ACAGGTGTTG 450 

AAATGTTCCG TAAATTGTTA GACTATGCTG AAGCAGGGGA TAACATTGGT 500 

GCATTGCTAC GTGGGGTTGC TCGTGAAGAC ATCCAACGTG GACAAGTATT 550 

GGCTAAAGCT GGTACAATCA CACCTCATAC AAAATTCAAA GCTGAAGTTT 600 

ATGTTTTGAC AAAAGAAGAA GGTGGACGTC ACACTCCATT C 641 

20 

2) INFORMATION FOR SEQ ID NO: 874 

25 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 681 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



30 



55 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Staphylococcus haemolyticus 
35 (B) STRAIN: LSPQ 2514 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 874 

ACCAGCATTA GTAGTATTCT TAAATAAAGT TGACATGGTT GACGATGAAG 50 

40 AATTATTAGA ATTAGTTGAA ATGGAAGTAC GTGACTTATT ATCTGAATAC 100 

GACTTCCCAG GTGACGATGT ACCTGTAATC GCTGGTTCAG CATTAAAAGC 150 

TTTAGAAGGC GATGCTCAAT ACGAAGAAAA AATCTTAGAA TTAATGCAAG 200 

CAGTTGATGA CTACATTCCA ACTCCAGAAC GTGATTCTGA CAAACCATTC 250 

ATGATGCCAG TTGAGGACGT ATTCTCAATC ACTGGTCGTG GTACTGTTGC 300 

45 TACAGGCCGT GTTGAACGTG GGCAAATCAA AGTTGGTGAA GAAGTTGAAA 3 50 

TCATTGGTAT CCATGACACT TCTAAAACAA CTGTTACTGG TGTAGAAATG 400 

TTCCGTAAAT TATTAGACTA CGCTGAAGCT GGTGACAACA TCGGTGCATT 450 

ATTACGTGGT GTTGCTCGTG AAGACGTACA ACGTGGTCAA GTATTAGCTG 500 

CTCCAGGTTC AATCACACCT CACACAAAAT TTAAAGCAGA CGTATACGTT 550 

50 TTATCTAAAG ACGAAGGTGG ACGTCACACT CCATTCTTCA CAAACTATCG 600 

TCCACAATTC TATTTCCGTA CTACTGACGT AACTGGTGTT GTTAACTTAC 650 

CAGAAGGTAC TGAAATGGTT ATGCCTGGCG A 681 



2) INFORMATION FOR SEQ ID NO: 875 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 675 bases 
60 (B) TYPE: Nucleic acid 
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STRANDEDNESS : Double 
TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE : 

(A) ORGANISM: Staphylococcus epidermidis 

(B) STRAIN: R591 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 875 



ATTATCACGT 
TTGACATGGT 
CGTGACTTAT 
CGCTGGTTCT 
AAATCTTAGA 
CGTGATTCTG 
CACTGGTCGT 
AAGTTGGTGA 
ACTGTTACTG 
TGGTGACAAC 
AACGTGGTCA 
TTCAAAGCTG 
TCCATTCTTC 
TAACTGGTGT 



AACGTTGGTG 
AGACGACGAA 
TAAGCGAATA 
GCATTAAAAG 
CTTAATGCAA 
ACAAACCATT 
GGTACTGTTG 
AGAAGTTGAA 
GTGTAGAAAT 
ATCGGTGCTT 
AGTATTAGCT 
AAGTATACGT 
ACTAACTATC 
TGTAAACTTA 



TACCAGCATT 
GAATTATTAG 
TGACTTCCCA 
CATTAGAAGG 
GCAGTTGATG 
CATGATGCCA 
CTACAGGCCG 
ATCATCGGTA 
GTTCCGTAAA 
TATTACGTGG 
GCTCCTGGTT 
ATTATCTAAA 
GCCCACAATT 
CCAGA 



AGTTGTATTC 
AATTAGTTGA 
GGTGACGATG 
CGATGCTGAA 
ATTACATTCC 
GTTGAGGACG 
TGTTGAACGT 
TGCACGAAAC 
TTATTAGACT 
TGTTGCACGT 
CTATTACACC 
GATGAAGGTG 
CTATTTCCGT 



TTAAACAAAG 50 

AATGGAAGTT 100 

TACCTGTAAT 150 

TACGAACAAA 2 00 

AACTCCAGAA 250 

TATTCTCAAT 3 00 

GGTCAAATCA 350 

TTCTAAAACA 4 00 

ACGCTGAAGC 450 

GAAGACGTAC 500 

ACACACAAAA 550 

GACGTCACAC 600 

ACTACTGACG 650 

675 
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2) INFORMATION FOR SEQ ID NO: 8 76 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 704 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 
35 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
40 (A) ORGANISM: Staphylococcus epidermidis 

(B) STRAIN: CSG 10 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 876 

45 TCTTATTATC ACGTAACGTT GGTGTACCAG CATTAGTTGT ATTCTTAAAC 50 

AAAGTTGACA TGGTAGACGA CGAAGAATTA TTAGAATTAG TTGAAATGGA 100 

AGTTCGTGAC TTATTAAGCG AATATGACTT CCCAGGTGAC GATGTACCTG 150 

TAATCGCTGG TTCTGCATTA AAAGCATTAG AAGGCGATGC TGAATACGAA 200 

CAAAAAATCT TAGACTTAAT GCAAGCAGTT GATGATTACA TTCCAACTCC 25 0 

50 AGAACGTGAT TCTGACAAAC CATTCATGAT GCCAGTTGAG GACGTATTCT 3 00 

CAATCACTGG TCGTGGTACT GTTGCTACAG GCCGTGTTGA ACGTGGTCAA 350 

ATCAAAGTTG GTGAAGAAGT TGAAATCATC GGTATGCACG AAACTTCTAA 400 

AACAACTGTT ACTGGTGTAG AAATGTTCCG TAAATTATTA GACTACGCTG 450 

AAGCTGGTGA CAACATCGGT GCTTTATTAC GTGGTGTTGC ACGTGAAGAC 500 

55 GTACAACGTG GTCAAGTATT AGCTGCTCCT GGTTCTATTA CACCACACAC 550 

AAAATTCAAA GCTGAAGTAT ACGTATTATC TAAAGATGAA GGTGGACGTC 600 

ACACTCCATT CTTCACTAAC TATCGCCCAC AATTCTATTT CCGTACTACT 650 
GACGTAACTG GTGTTGTAAA CTTACCAGAA GGTACAGAAA TGGTTATGCC 
TGGC 



60 



700 
704 
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2) INFORMATION FOR SEQ ID NO: 877 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 770 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Staphylococcus epidermidis 

(B) STRAIN: ATCC 35984 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 877 

TCTTAGTTGT ATCTGCTGCT GACGGTCCAA TGCCACAAAC TCGTGAACAC 50 

ATCTTATTAT CACGTAACGT TGGTGTACCA GCATTAGTTG TATTCTTAAA 100 

CAAAGTTGAC ATGGTAGACG ACGAAGAATT ATTAGAATTA GTTGAAATGG 150 

AAGTTCGTGA CTTATTAAGC GAATATGACT TCCCAGGTGA CGATGTACCT 2 00 

GTAATCGCTG GTTCTGCATT AAAAGCATTA GAAGGCGATG CTGAATACGA 250 

ACAAAAAATC TTAGACTTAA TGCAAGCAGT TGATGATTAC ATTCCAACTC 300 

CAGAACGTGA TTCTGACAAA CCATTCATGA TGCCAGTTGA GGACGTATTC 3 50 

TCAATCACTG GTCGTGGTAC TGTTGCTACA GGCCGTGTTG AACGTGGTCA 400 

AATCAAAGTT GGTGAAGAAG TTGAAATCAT CGGTATGCAC GAAACTTCTA 450 

AAACAACTGT TACTGGTGTA GAAATGTTCC GTAAATTATT AGACTACGCT 50 0 

GAAGCTGGTG ACAACATCGG TGCTTTATTA CGTGGTGTTG CACGTGAAGA 550 

CGTACAACGT GGTCAAGTAT TAGCTGCTCC TGGTTCTATT ACACCACACA 600 

CAAAATTCAA AGCTGAAGTA TACGTATTAT CTAAAGATGA AGGTGGACGT 650 

CACACTCCAT TCTTCACTAA CTATCGCCCA CAATTCTATT TCCGTACTAC 700 

TGACGTAACT GGTGTTGTAA ACTTACCAGA AGGTACAGAA ATGGTTATGC 750 

CTGGCGACAA CGTTGAAATG 77 0 



2 ) INFORMATION FOR SEQ ID NO: 878 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 716 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi ) ORIGINAL SOURCE : 

(A) ORGANISM: Staphylococcus epidermidis 

(B) STRAIN: ATCC 35983 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 878 

TTGTATTCTT AAACAAAGTT GACATGGTAG ACGACGAAGA ATTATTAGAA 50 

TTAGTTGAAA TGGAAGTTCG TGACTTATTA AGCGAATATG ACTTCCCAGG 100 

TGACGATGTA CCTGTAATCG CTGGTTCTGC ATTAAAAGCA TTAGAAGGCG 150 

ATGCTGAATA CGAACAAAAA ATCTTAGACT TAATGCAAGC AGTTGATGAT 200 

TACATTCCAA CTCCAGAACG TGATTCTGAC AAACCATTCA TGATGCCAGT 250 

TGAGGACGTA TTCTCAATCA CTGGTCGTGG TACTGTTGCT ACAGGCCGTG 300 

TTGAACGTGG TCAAATCAAA GTTGGTGAAG AAGTTGAAAT CATCGGTATG 350 
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CACGAAACTT CTAAAACAAC TGTTACTGGT GTAGAAATGT TCCGTAAATT 40 0 

ATTAGACTAC GCTGAAGCTG GTGACAACAT CGGTGCTTTA TTACGTGGTG 450 

TTGCACGTGA AGACGTACAA CGTGGTCAAG TATTAGCTGC TCCTGGTTCT 500 

ATTACACCAC ACACAAAATT CAAAGCTGAA GTATACGTAT TATCTAAAGA 550 

TGAAGGTGGA CGTCACACTC CATTCTTCAC TAACTATCGC CCACAATTCT 600 

ATTTCCGTAC TACTGACGTA ACTGGTGTTG TAAACTTACC AGAAGGTACA 650 

GAAATGGTTA TGCCTGGCGA CAACGTTGAA ATGACAGTTG AATTAATCGC 70 0 

TCCAATCGCT ATCGAA 716 



2) INFORMATION FOR SEQ ID NO: 879 



(i) SEQUENCE CHARACTERISTICS: 
15 (A) LENGTH: 64 0 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

2 0 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus gallinarum 

(B) STRAIN: R764 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 879 



CGGTCCTATG CCTCAAACTC GTGAACACAT CTTGTTATCA CGTAACGTTG 50 

GCGTACCATA CATCGTTGTT TTCTTGAACA AAATGGATAT GGTTGATGAC 100 

3 0 GAAGAATTGC TAGAATTAGT TGAAATGGAA GTTCGTGACC TATTGTCTGA 150 

ATATGACTTC CCAGGCGACG ATGTTCCTGT AATCGCCGGT TCTGCTTTGA 200 

AAGCTCTTGA AGGAGATCCT TCATACGAAG AAAAAATCAT GGAATTGATG 250 

GCTGCAGTTG ACGAATACGT TCCAACTCCA GAACGTGATA CTGACAAACC 3 00 

ATTCATGATG CCAGTCGAAG ACGTATTCTC AATCACTGGA CGTGGTACTG 350 

35 TTGCTACAGG CCGTGTTGAA CGTGGACAAG TTCGCGTTGG TGATGAAGTA 400 

GAAATCGTTG GTATTGCTGA CGAAACTGCT AAAACAACTG TAACAGGTGT 450 

TGAAATGTTC CGTAAATTGT TAGACTATGC TGAAGCAGGG GATAACATTG 500 

GTGCATTGCT ACGTGGGGTT GCTCGTGAAG ACATCCAACG TGGACAAGTA 550 

TTGGCTAAAG CTGGTACAAT CACACCTCAT ACAAAATTCA AAGCTGAAGT 600 

40 TTATGTTTTG ACAAAAGAAG AAGGTGGACG TCACACTCCA 640 



2) INFORMATION FOR SEQ ID NO : 880 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 831 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 
50 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
55 (A) ORGANISM: Pseudomonas aeruginosa 

(B) STRAIN: PAO-1 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 880 

60 CGGCGCGATC CTGGTTTGCT CGGCTGCCGA CGGCCCCATG CCGCAGACCC 50 
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GCGAGCACAT CCTGCTGTCC CGCCAGGTAG 
TTCCTGAACA * AGGCCGACAT GGTCGACGAC 
CGAGATGGAA GTTCGCGATC TGCTGAACAC 
ACACTCCGAT CATCATCGGT TCCGCGCTGA 
5 GACAACGGCA TCGGCGTAAG CGCCGTGCAG 
CTCCTACATT CCGGAGCCGG TTCGTGCCAT 
CGATCGAAGA CGTGTTCTCG ATCTGCGGCC 
CGTGTAGAGC GCGGCATCAT CAAGGTCCAG 
CATCAAGGCG ACCACCAAGA CTACCTGCAC 

10 AGCTGCTCGA CGAAGGTCGT GCTGGTGAGA 
GGCACCAAGC GTGAAGACGT AGAGCGTGGC 
CACCATCAAG CCGCACACCA AGTTCGAGTG 
AGGAAGAAGG TGGTCGTCAC ACCCCGTTCT 
TTCTACTTCC GTACCACCGA YGTGACCGGT 

15 CGTAGAGATG GTAATGCCGG GCGACAACAT 
TCGCTCCGAT CGCCATGGAA GATGGCCTGC 



PCT/CA00/01150 

GCGTTCCCTA CATCGTC&TG 100 

GCCGAGCTGC TGGAACTGGT 150 

CTACGACTTC CCGGGCGACG 200 

TGGCGCTGGA AGGCAAGGAT 250 

AAGCTGGTAG AGACCCTGGA 3 00 

CGACCAGCCG TTCCTGATGC 3 50 

GCGGTACCGT GGTAACCGGT 400 

GAAGAAGTGG AAATCGTCGG 4 50 

CGGCGTTGAA ATGTTCCGCA 500 

ACGTTGGTAT CCTGCTGCGT 550 

CAGGTTCTGG CCAAGCCGGG 600 

CGAAGTGTAC GTGCTGTCCA 650 

TCAAGGGCTA CCGTCCGCAG 700 

AACTGCGAAC TGCCGGAAGG 750 

CAAGATGGTT GTCACCCTGA 80 0 

G 831 



20 2) INFORMATION FOR SEQ ID NO: 881 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 642 bases 

(B) TYPE: Nucleic acid 
25 (C) STRANDEDNESS : Double 

(D) TOPOLOGY : Linear 

(ii) MOLECULE TYPE: Genomic DNA 

3 0 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus casseliflavus 

(B) STRAIN: R421 



35 



50 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 881 



CCTATGCCTC AAACACGTGA ACACATCTTG TTATCACGTA ACGTTGGTGT 50 

ACCATACATC GTTGTTTTCT TAAACAAAAT GGATATGGTT GATGACGAAG 100 

AATTACTAGA ATTAGTTGAA ATGGAAGTTC GTGACTTATT GTCAGAATAT 150 

GACTTCCCAG GCGACGATGT TCCTGTAATC GCTGGTTCTG CTTTGAAAGC 2 00 

40 TCTTGAAGGC GATGCTTCAT ACGAAGAAAA AATCATGGAA TTAATGGCTG 250 

CAGTTGACGA ATACGTTCCA ACTCCAGAAC GTGACACTGA CAAACCATTC 3 00 

ATGATGCCAG TCGAAGACGT ATTCTCAATC ACTGGACGTG GTACTGTTGC 350 

TACAGGCCGT GTTGAACGTG GACAAGTTCG CGTTGGTGAC GAAGTTGAAA 400 

TCGTTGGTAT TGCTGAAGAA ACTGCTAAAA CAACTGTAAC TGGTGTTGAA 450 

4 5 ATGTTCCGTA AATTGTTAGA CTATGCTGAA GCAGGGGATA ACATTGGTGC 500 

ATTGCTACGT GGTGTTGCTC GTGAAGACAT CCAACGTGGA CAAGTATTGG 550 

CTAAAGCTGG TACAATCACA CCTCATACAA AATTTAAAGC TGAAGTTTAC 600 

GTTTTAACAA AAGAAGAAGG TGGACGTCAC ACACCATTCT TC 642 



2) INFORMATION FOR SEQ ID NO: 882 



(i) SEQUENCE CHARACTERISTICS: 
55 (A) LENGTH: 636 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

60 (ii) MOLECULE TYPE: Genomic DNA 
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(vi) ORIGINAL SOURCE: 

(A) ORGANISM : Enterococcus casseli flavus 

(B) STRAIN: R775 

5 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 882 

CCTATGCCTC AAACACGTGA ACACATCTTG TTATCACGTA ACGTTGGTGT 50 

ACCATACATC GTTGTTTTCT TAAACAAAAT GGATATGGTT GATGACGAAG 10 0 

10 AATTACTAGA ATTAGTTGAA ATGGAAGTTC GTGACTTATT GTCAGAATAT 150 

GACTTCCCAG GCGACGATGT TCCTGTAATC GCTGGTTCTG CTTTGAAAGC 200 

TCTTGAAGGC GATGCTTCAT ACGAAGAAAA AATCATGGAA TTAATGGCTG 250 

CAGTTGACGA ATACGTTCCA ACTCCAGAAC GTGACACTGA CAAACCATTC 300 

ATGATGCCAG TCGAAGACGT ATTCTCAATC ACTGGACGTG GTACTGTTGC 350 

15 TACAGGCCGT GTTGAACGTG GACAAGTTCG CGTTGGTGAC GAAGTTGAAA 4 00 

TCGTTGGTAT TGCTGAAGAA ACTGCTAAAA CAACTGTAAC TGGTGTTGAA 450 

ATGTTCCGTA AATTGTTAGA CTATGCTGAA GCAGGGGATA ACATTGGTGC 50 0 

ATTGCTACGT GGTGTTGCTC GTGAAGACAT CCAACGTGGA CAAGTATTGG 550 

CTAAAGCTGG TACAATCACA CCTCATACAA AATTTAAAGC TGAAGTTTAC 600 

20 GTTTTAACAA AAGAAGAAGG TGGACGTCAT ACACCA 63 6 



25 



55 



2) INFORMATION FOR SEQ ID NO: 883 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 641 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 
30 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
35 (A) ORGANISM: Enterococcus faecalis 

(B) STRAIN: R422 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 883 

40 GGTCCTATGC CTCAAACACG TGAACATATC TTATTATCAC GTAACGTTGG 50 

TGTACCATAC ATCGTTGTAT TCTTAAACAA AATGGATATG GTTGATGACG 10 0 

AAGAATTATT AGAATTAGTA GAAATGGAAG TTCGTGACTT ATTATCAGAA 150 

TACGATTTCC CAGGCGATGA TGTTCCAGTT ATCGCAGGTT CTGCTTTGAA 2 00 

AGCTTTAGAA GGCGACGAGT CTTATGAAGA AAAAATCTTA GAATTAATGG 250 

45 CTGCAGTTGA CGAATATATC CCAACTCCAG AACGTGATAC TGACAAACCA 3 00 

TTCATGATGC CAGTCGAAGA CGTATTCTCA ATCACTGGAC GTGGTACTGT 350 

TGCTACAGGC CGTGTTGAAC GTGGTGAAGT TCGCGTTGGT GACGAAGTTG 4 00 

AAATCGTTGG TATTAAAGAC GAAACATCTA AAACAACTGT TACAGGTGTT 450 

GAAATGTTCC GTAAATTATT AGACTACGCT GAAGCAGGCG ACAACATCGG 50 0 

50 TGCTTTATTA CGTGGTGTAG CACGTGAAGA TATCGAACGT GGACAAGTAT 550 

TAGCTAAACC AGCTACAATC ACTCCACACA CAAAATTCAA AGCTGAAGTA 60 0 

TACGTATTAT CAAAAGAAGA AGGCGGACGT CACACTCCAT T 641 



2) INFORMATION FOR SEQ ID NO: 884 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 64 0 bases 
60 (B) TYPE: Nucleic acid 

496 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 



PCT/CA00/01150 



10 



(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus faecalis 

(B) STRAIN: R575 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 884 



TATGCCTCAA 
CATACATCGT 
TTATTAGAAT 

15 TTTCCCAGGC 
TAGAAGGCGA 
GTTGACGAAT 
GATGCCAGTC 
CAGGCCGTGT 

2 0 GTTGGTATTA 
GTTCCGTAAA 
TATTACGTGG 
AAACCAGCTA 
ATTATCAAAA 

25 



ACACGTGAAC 
TGTATTCTTA 
TAGTAGAAAT 
GATGATGTTC 
CGAGTCTTAT 
ATATCCCAAC 
GAAGACGTAT 
TGAACGTGGT 
AAGACGAAAC 
TTATTAGACT 
TGTTGCACGT 
CAATCACTCC 
GAAGAAGGCG 



ATATCTTATT 
AACAAAATGG 
GGAAGTTCGT 
CAGTTATCGC 
GAAGAAAAAA 
TCCAGAACGT 
TCTCAATCAC 
GAAGTTCGCG 
ATCTAAAACA 
ACGCTGAAGC 
GAAGATATCG 
ACACACAAAA 
GACGTCACAC 



ATCACGTAAC 
ATATGGTTGA 
GACTTATTAT 
AGGTTCTGCT 
TCTTAGAATT 
GATACTGACA 
TGGACGTGGT 
TTGGTGACGA 
ACTGTTACAG 
AGGCGACAAC 
AACGTGGACA 
TTCAAAGCTG 
TCCATTCTTC 



GTTGGTGTAC 50 

TGACGAAGAA 10 0 

CAGAATACGA 150 

TTGAAAGCTT 200 

AATGGCTGCA 250 

AACCATTCAT 3 00 

ACTGTTGCTA 350 

AGTTGAAATC 4 00 

GTGTTGAAAT 450 

ATCGGTGCTT 500 

AGTATTAGCT 550 

AAGTATACGT 600 

640. 



2) INFORMATION FOR SEQ ID NO: 885 

30. (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 632 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



35 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus faecium 
40 (B) STRAIN: R492 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 885 

TGCCTCAAAC TCGTGAACAC ATCCTATTGT CTCGTCAAGT TGGTGTTCCT 50 

45 TACATCGTTG TATTCTTGAA CAAAGTAGAC ATGGTTGATG ACGAAGAATT 100 

ACTAGAATTA GTTGAAATGG AAGTTCGTGA CCTATTAACA GAATACGAAT 150 

TCCCTGGTGA CGATGTTCCT GTAGTTGCTG GATCAGCTTT GAAAGCTCTA 2 00 

GAAGGCGACG CTTCATACGA AGAAAAAATT CTTGAATTAA TGGCTGCAGT 250 

TGACGAATAC ATCCCAACTC CAGAACGTGA CAACGACAAA CCATT CATGA 30 0 

50 TGCCAGTTGA AGACGTGTTC TCAATTACTG GACGTGGTAC TGTTGCTACA 35 0 

GGTCGTGTTG AACGTGGACA AGTTCGCGTT GGTGACGAAG TTGAAGTTGT 40 0 

TGGTATTGCT GAAGAAACTT CAAAAACAAC AGTTACTGGT GTTGAAATGT 450 

TCCGTAAATT GTTAGACTAC GCTGAAGCTG GAGACAACAT TGGTGCTTTA 500 

CTACGTGGTG TTGCACGTGA AGACATCCAA CGTGGACAAG TTTTAGCTAA 550 

55 ACCAGGTACA ATCACACCTC ATACAAAATT CTCTGCAGAA GTATACGTGT 600 

TGACAAAAGA AGAAGGTGGA CGTCATACTC CA 632 



60 2 ) INFORMATION FOR SEQ ID NO: 886 

497 



SUBSTITUTE SHEET (RULE 26) 
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PCT/CA00/01150 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 640 bases 

(B) TYPE: Nucleic acid 

5 (C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

10 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus faecium 

(B) STRAIN: R576 



15 



30 



45 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 886 



CGGCCCAATG CCTCAAACTC GTGAACACAT CCTATTGTCT CGTCAAGTTG 50 

GTGTTCCTTA CATCGTTGTA TTCTTGAACA AAGTAGACAT GGTTGATGAC 100 

GAAGAATTAC TAGAATTAGT TGAAATGGAA GTTCGTGACC TATTAACAGA 150 

ATACGAATTC CCTGGTGACG ATGTTCCTGT AGTTGCTGGA TCAGCTTTGA 200 

2 0 AAGCTCTAGA AGGCGACGCT TCATACGAAG AAAAAATTCT TGAATTAATG 250 

GCTGCAGTTG ACGAATACAT CCCAACTCCA GAACGTGACA ACGACAAACC 300 

ATTCATGATG CCAGTTGAAG ACGTGTTCTC AATTACTGGA CGTGGTACTG 350 

TTGCTACAGG TCGTGTTGAA CGTGGACAAG TTCGCGTTGG TGACGAAGTT 400 

GAAGTTGTTG GTATTGCTGA AGAAACTTCA AAAACAACAG TTACTGGTGT 450 

25 TGAAATGTTC CGTAAATTGT TAGACTACGC TGAAGCTGGA GACAACATTG 500 

GTGCTTTACT ACGTGGTGTT GCACGTGAAG ACATCCAACG TGGACAAGTT 550 

TTAGCTAAAC CAGGTACAAT CACACCTCAT ACAAAATTCT CTGCAGAAGT 600 

ATACGTGTTG ACAAAAGAAG AAGGTGGACG TCATACTCCA 640 



2) INFORMATION FOR SEQ ID NO : 887 



(i) SEQUENCE CHARACTERISTICS: 
35 (A) LENGTH: 806 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

40 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Zoogloea ramlgera 

(B) STRAIN: ATCC 25935 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 887 



ATCCTGGTTT GCTCCGCAGC TGACGGCCCA ATGCCACAGA CCCGCGAGCA 50 

CATCCTGCTG GCCCGCCAAG TTGGCGTTCC ATACATCATC GTGTTCCTGA 100 

50 ACAAGTGCGA CCTGGTTGAC GACGCAGAAC TGCTGGAACT GGTCGAAATG 150 

GAAGTGCGTG AATTGCTGTC GAAATACGAG TTCCCAGGCG ACGACGTACC 200 

AATCATCAAG GGTTCGGCAC GTATGGCGCT GGAAGGCAAA GAAGGCGAGA 250 

TGGGCGTTGA CGCCATCATG CGTCTGGCCG ATGCACTGGA CAGCTACATC 3 00 

CCTACGCCAG AGCGCGCAGT CGATGGCGCC TTCCTGATGC CAGTGGAAGA 350 

55 CGTGTTCTCG ATCTCGGGTC GCGGTACCGT TGTGACCGGT CGTATCGAGC 400 

GCGGCGTGAT CAAGGTCGGC GAAGAGATCG AAATCGTCGG CATTATCGAC 450 

ACCGTCAAAA CCACTTGCAC CGGCGTGGAA ATGTTCCGCA AGCTGCTGGA 500 

CCAGGGTCAA GCCGGCGACA ACGTTGGTCT GCTGCTGCGC GGCACCAAGC 550 

GTGAAGACGT ACAGCGTGGT CAGGTTCTGG CCAAGCCAGC GTCGATCAAG 600 

60 CCGCACAACC ACTTCACCGG CGAGATCTAC GTTCTGTCGA AAGATGAAGG 650 

498 
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CGGCCGTCAC ACCCCGTTCT 
GTACGACTGA CGTGACCGGT 
GTCATGCCAG GCGACAACGT 
CGCGAT 



TCAACAACTA TCGTCCACAG 
TCGATCGAAC TGCCAGCAGA 
GTCGATCACC GTCAAGCTGA 



TTCTACTTCC 7 00" 

CAAAGAAATG 750 
TCAACCCGAT 800 

806 



2 ) INFORMATION FOR SEQ ID NO : 888 



10 



15 



20 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 634 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus faecalis 

(B) STRAIN : R503 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 888 



25 



30 



35 



TATGCCTCAA 
CATACATCGT 
TTATTAGAAT 
TTTCCCAGGC 
TAGAAGGCGA 
GTTGACGAAT 
GATGCCAGTC 
CAGGCCGTGT 
GTTGGTATTA 
GTTCCGTAAA 
TATTACGTGG 
AAACCAGCTA 
ATTATCAAAA 



ACACGTGAAC 
TGTATTCTTA 
TAGTAGAAAT 
GATGATGTTC 
CGAGTCTTAT 
ATATCCCAAC 
GAAGACGTAT 
TGAACGTGGT 
AAGACGAAAC 
TTATTAGACT 
TGTAGCACGT 
CAATCACTCC 
GAAGAAGGCG 



ATATCTTATT 
AACAAAATGG 
GGAAGTTCGT 
CAGTTATCGC 
GAAGAAAAAA 
TCCAGAACGT 
TCTCAATCAC 
GAAGTTCGCG 
ATCTAAAACA 
ACGCTGAAGC 
GAAGATATCG 
ACACACAAAA 
GACGTCACAC 



ATCACGTAAC 
ATATGGTTGA 
GACTTATTAT 

AGGTTCTGCT 
TCTTAGAATT 
GATACTGACA 
TGGACGTGGT 
TTGGTGACGA 
ACTGTTACAG 
AGGCGACAAC 
AACGTGGACA 
TTCAAAGCTG 
TCCA 



GTTGGTGTAC 50 

TGACGAAGAA 10 0 

CAGAATACGA 150 

TTGAAAGCTT 2 00 

AATGGCTGCA 250 

AACCATTCAT 3 00 

ACTGTTGCTA 3 50 

AGTTGAAATC 400 

GTGTTGAAAT 450 

ATCGGTGCTT 500 

AGTATTAGCT 550 

AAGTATACGT 600 

634 



40 2 ) INFORMATION FOR SEQ ID NO : 889 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 4 93 bases 
(B> TYPE: Nucleic acid 
45 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

50 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Aspergillus fumigatus 

(B) STRAIN: ATCC 14110 



55 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 889 



TGTCTTCATC CAGGAATTGA TTGTGAGTCG TTCCACATGC TCACCTAGTT 50 

TTCGCTCGAT CTTTTCACTA ACGCAAACCA TGTAGAACAA CATTGCCAAG 100 

GCCCACGGTG GTTACTCCGT CTTCACTGGT GTTGGTGAGC GTACTCGTGA 150 

GGGTAACGAT CTGTACCACG AAATGCAGGA GACTGGTGTC ATTCAGCTCG 200 

60 AGGGTGAATC CAAGGTCGCA CTGGTGTTCG GACAGATGAA CGAGCCCCCC 250 

499 



SUBSTITUTE SHEET (RULE 26) 
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PCT/CA00/01150 



GGTGCCCGTG CCCGTGTCGC CCTTACCGGT 
CCGTGACGAG GAGGGTCAGG ACGTGCTGCT 
GTTTCACCCA GGCCGGTTCT GAGGTGTCTG 
TCTGCCGTCG GTTACCAGCC CACCCTGGCC 
5 GGAGCGTATC ACCACCACCA AGAAGGGTTC 



CTGACCATTG CCGAGTACTT 3 00 

CTTCATTGAC AACATTTTCC 350 

CCCTTCTCGG TCGTATCCCC 400 

GTCGACATGG GTGGTATGCA 450 

TATTACCTCC GTC 493 



2) INFORMATION FOR SEQ ID NO: 8 90 



10 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 66 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 
15 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
2 0 (A) ORGANISM: Penicillium marneffei 

(B) STRAIN: ATCC 64101 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8 90 

2 5 TTGATTGTAC GTCTTTACCT TTCTGCCTGA CTGTTTACGA CAACTAACGA 50 

AAGCGTAGAA CAACATTGCC AAGGCTCACG GTGGTTACTC TGTCTTCACT 100 

GGTGTCGGTG AACGTACTCG TGAGGGTAAC GATTTGTACC ACGAAATGCA 150 

GGAAACTGGT GTCATTCAGC TCGAGGGTGA ATCCAAGGTC GCCCTCGTGT 2 00 

TCGGTCAGAT GAACGAGCCC CCCGGTGCCC GTGCCCGTGT CGCTCTTACT 250 

3 0 GGTTTGACCA TTGCCGAGTA CTTCCGTGAC GAGGAAGGTC AGGACGTGCT 300 

TCTCTTCATT GACAACATTT TCCGTTTCAC TCAGGCCGGT TCTGAGGTGT 3 50 

CTGCCCTTCT GGGTCGTATC CCCTCTGCCG TCGGTTACCA GCCCACCCTT 400 

GCCGTCGACA TGGGTATCAT GCAGGAGCGT ATTACCACCA CCACCAAGGG 450 

TTCCATCACC TCCGTC 466 

35 



2) INFORMATION FOR SEQ ID NO: 891 

40 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 478 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



45 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Paecilomyces lilacinus 
50 (B) STRAIN: ATCC 42570 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 891 

AGGAGCTGAT CAACAACATC GCCAAGGCTC ACGGTGGTTA CTCCGTCTTC 50 

55 ACTGGTGTCG GTGAGCGTAC CCGTGAGGGT AACGATCTGT ACCACGAAAT 100 

GCAGGAGACC TCGGTCATTC AGCTCGAGGG CGAGTCTAAG GTGGCCCTGG 150 

TCTTTGGTCA GATGAACGAG CCCCCGGGTG CTCGTGCCCG TGTCGCTCTT 200 

ACTGGTCTTA CCGTCGCCGA GTACTTCCGT GACCAGGAGG GTCAGGATGG 250 

TTAGTTCTCG TCCACTCATG CCGAAACATG TGCGTGTTCC GAGGCTAATC 3 00 

60 AACGTGCCAG TGCTGCTTTT CATCGACAAC ATTTTCCGAT TCACACAGGC 350 

500 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 PCT/CA00/01150 

CGGTTCCGAG GTGTCTGCCC TGCTGGGTCG^ IkTCcfcCT 5f ' GCCGTfcGGTT 40~d/ 
ACCAGCCCAC CCTCGCCGTC GACATGGGTG GCATGCAGGA GCGTATCACC 4 50 

ACCACCAAGA AGGGCTCTAT CACCTCCG 473 



2) INFORMATION FOR SEQ ID NO: 892 

(i) SEQUENCE CHARACTERISTICS: 
1° (A) LENGTH: 481 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

15 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE : 

(A) ORGANISM: Penicillium marneffei 

(B) STRAIN: ATCC 58950 



20 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 892 



GTCTTTATCC AGGAGTTGAT TGTACGTCTT TACCTTTCTG CCTGACTGTT 50 

TACGACAACT AACGAAAGCG TAGAACAACA TTGCCAAGGC TCACGGTGGT 100 

25 TACTCTGTCT TCACTGGTGT CGGTGAACGT ACTCGTGAGG GTAACGATTT 150 

GTACCACGAA ATGCAGGAAA CTGGTGTCAT TCAGCTCGAG GGTGAATCCA 200 

AGGTCGCCCT CGTGTTCGGT CAGATGAACG AGCCCCCCGG TGCCCGTGCC 250 

CGTGTCGCTC TTACTGGTTT GACCATTGCC GAGTACTTCC GTGACGAGGA 3 00 

AGGTCAGGAC GTGCTTCTCT TCATTGACAA CATTTTCCGT TTCACTCAGG 3 50 

30 CCGGTTCTGA GGTGTCTGCC CTTCTGGGTC GTATCCCCTC TGCCGTCGGT 4 00 

TACCAGCCCA CCCTTGCCGT CGACATGGGT ATCATGCAGG AGCGTATTAC 450 
CACCACCACC AAGGGTTCCA TCACCTCCGT C 



35 



45 



2 ) INFORMATION FOR SEQ ID NO: 893 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 1208 bases 
40 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Spor ot.hr ix schenckii 

(B) STRAIN: ATCC 14285 

50 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 893 

CTCGTCCGTG GTGCTAAGGC CACTGACACT GGTTSCCCCA TTACCATCCS 50 

CGTCGGCCCC GGTACCCTCG GTCGCATCAT GAACGTCACC GGTGACCCGA 100 

TCGACGAGCG CGGTCCCATC AAGACCGACA AGTTCCGTCC CATCCACGCT 150 

55 GAGGCTCCCG AGTTCGTTGA CCAGTCGACC ACCGCTGAGG TTCTCGTGAC 200 

~ T £ TCAAG GTCGTCGATC TGCTTGCTCC CTACGCCCGT GGTGGTAAGA 250 

TTGGTCTGTT TGGCGGTGCC GGTGTTGGCA AGACCGTGTT CATCCAGGAG 3 00 

CTCATCAACA ACATCGCCAA GGCCCACGGT GGTTACTCCG TCTTCACCGG 350 

^^ CGAG CGTACCCGTG AGGGTAACGA TCTGTACCAC GAAATGCAGG 4 00 

60 AGACCTCTGT CATTCAGCTT GACGGTGACT CCAAGGTCGC CCTGGTGTTC 450 

501 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 

GGTCAGATGA ACGAGCCCCC 
TTTGACTGTC GCTGAGTACT 
TTTGAATTAT TTCCTTGTCG 
GTCAGTGCTT CTCTTCATCG 
CTGAGGTGTC TGCCCTTCTG 
CCCACGCTCG CCGTGGACAT 
CCGCAAGGGC TCAATTACCT 
ATCTGACGGA TCCCGCCCCC 
ACTGTGCTGT CCCGAGGTAT 
CCCCCTCGAC TCCAAGTCGC 
ACCACTACGA GACCGCCACT 
TCGCTGCAGG ACATCATCGC 
CGACAAGCTT ACAGTCGAGC 
AGCCGTTCAC GGTCGCGCAG 
GATCTGAAGG ACACTATCGC 
TGACAGCC 



TGGTGCTCGT GCCCGTGTCG 
TCCGTGACGA GGAGGGCCAG 
TACAGTTCCA AATCGAAGAA 
ACAACATTTT CCGCTTCACC 
GGTCGTATTC CCTCCGCTGT 
GGGTCTGATG CAGGAGCGTA 
CCGTGCAGGC CGTCTACGTG 
GCCACCACCT TCGCCCATCT 
CTCTGAGCTG GGTATCTACC 
GTATGCTGGA CCCCCGTATT 
CGCGTCCAGC AGATCCTCCA 
CATTCTGGGT ATGGACGAGC 
GTGCTCGTAA GATCCAGCGT 
GTCTTCACTG GTATCGAAGG 
TTCGTTCAAG GCTATCCTGA 



PCT/CA00/01150 



CCTTGACC!GG 


500 


GATGGTATGT 


5^0 


TTACTAACTT 


600 
\j \j \j 


CAGGCCGGTT 


6R0 


CGGTTACCAG 


70 0 


TTACCACCAC 




CCCGCTGACG 


ft 0 0 
o u u 


GGACGCCACC 


QCA 
O U 


CCGCTGTCGA 


900 
^ \j \j 


GTCGGTGACG 


950 


AGAGTACAAG 


1000 


TGTCTGAGGC 


1050 


TTCCTGAGCC 


1100 


CCAGCTGGTC 


1150 


GCGGTGAGGG 


1200 




1208 



2) INFORMATION FOR SEQ ID NO: 894 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 534 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Malbranchea filamentosa 

(B) STRAIN: ATCC 48174 

(Xi) SEQUENCE DESCRIPTION: SEQ 

TTCAGGAACT TATTGTAAGC CGCCCTCTTT 
AAGGCTGACA GGTAATAGAA CAACATTGCC 
CGTGTTCACT GGTGTCGGTG AGCGTACCCG 
ATGAAATGCA GGAAACCCGC GTCATCCAGC 
GCCCTTGTGT TCGGTCAGAT GAACGAGCCC 
CGCTCTTACT GGTCTTACCG TTGCCGAATA 
AAGATGGTAC GCCTTTTTAC TCTTCTTATT 
CTAACCTGCT CCAGTGCTTC TCTTCATTGA 
AAGCCGGTTC TGAGGTGTCT GCCTTGCTTG 
GGTTACCAGC CCACTCTCGC CGTCGACATG 
CACAACCACC AACAAGGGTT CCATTACTTC 



ID NO: 894 



ATGCATTGAG GGTGAATAAG 50 

AAGGCTCACG GTGGTTACTC 100 

TGAAGGAAAC GATCTGTACC 150 

TTGATGGCGA GTCTAAGGTC 2 00 

CCTGGAGCCC GTGCCCGTGT 250 

CTTCCGTGAC GAGGAGGGCC 300 

CTTCGGGTCG GACTACAGAA 35 0 

TAACATTTTC CGTTTCACAC 40 0 

GACGTATTCC CTCTGCCGTC 450 

GGTGGTATGC AGGAACGTAT 500 

CGTG 534 



2) INFORMATION FOR SEQ ID NO: 895 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 448 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 
(vi) ORIGINAL SOURCE: 

502 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 PCT/CA00/01 150 

(A) ORGANISM: Paecilomyces lilacinus 

(B) STRAIN: ATCC 60735 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 895 

5 

CAAGGCTCAC GGTGGTTACT CCGTCTTCAC TGGTGTCGGT GAGCGTACCC 50 

GTGAGGGTAA CGATCTGTAC CACGAAATGC AGGAGACCTC GGTCATTCAG 100 

CTCGAGGGCG AGTCTAAGGT GGCCCTGGTC TTTGGTCAGA TGAACGAGCC 150 

CCCGGGTGCT CGTGCCCGTG TCGCTCTTAC TGGTCTTACC GTCGCCGAGT 200 

10 ACTTCCGTGA CCAGGAGGGT CAGGATGGTT AGTTCTCGTC CACTCATGCC 250 

GAAACATGTG CGTGTTCCGA GGCTAATCAA CGTGCCAGTG CTGCTTTTCA 300 

TCGACAACAT TTTCCGATTC ACCCAGGCCG GTTCCGAGGT GTCTGCCCTG 3 50 

CTGGGTCGTA TCCCCTCTGC CGTCGGTTAC CAGCCCACCC TCGCCGTCGA 40 0 

CATGGGTGGC ATGCAGGAGC GTATCACCAC CACCAAGAAG GGCTCTAT 44 8 

15 



2) INFORMATION FOR SEQ ID NO: 896 

20 (i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 4 83 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



25 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Aspergillus niger 
30 (B) STRAIN: ATCC 9508 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 896 

ATTCAGGAGC TGATTGTAAG TTGCCAATCC ATGAACTGGA GATTTGGTGT 50 

35 GACCCATAGA ACTAACAAAT TATTTAGAAC AACATCGCCA AGGCTCACGG 100 

TGGTTACTCC GTCTTCTGTG GTGTCGGTGA GCGTACTCGT GAGGGTAACG 150 

ATCTGTACCA CGAAATGCAG GAGACTGGTG TCATCCAGCT CGAGGGTGAC 2 00 

TCCAAGGTCG CTCTGGTCTT CGGTCAGATG AACGAGCCCC CGGGTGCCCG 250 

TGCCCGTGTC GCCCTTACCG GTCTGACCAT TGCCGAGTAC TTCCGTGACG 300 

40 AGGAGGGTCA GGACGTGCTG CTCTTCATTG ACAACATTTT CCGTTTCACC 3 50 

CAGGCCGGTT CTGAGGTGTC TGCC CTTCTG GGTCGTATCC CCTCTGCCGT 400 

CGGTTACCAG CCCACTCTGG CCGTCGACAT GGGTGGTATG CAGGAGCGTA 450 

TTACCACCAC CACCAAGGGT TCCATTACCT CCG 483 

45 

2) INFORMATION FOR SEQ ID NO : 897 

(i) SEQUENCE CHARACTERISTICS: 
50 (A) LENGTH: 1124 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

55 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Aspergillus fumigatus 

(B) STRAIN: ATCC 14110 



60 



503 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 



PCT/CA00/01150 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 897 



AAGCTCAAGT 
GAAGTTCCAG 
5 CGCCCCGATA 
GGTCACCGTG 
CTGCGCTATC 
TCTCCAAGGA 
GGTGTCAAGC 

10 GTCCGAGGAT 
AGAAGGTCGG 
TTCAACGGTG 
GGGATGGGAG 
TCGAGGCCAT 

15 CTCCGTCTTC 
CCCTGTCGGC 
CCTTCGCCCC 
CACCAGCAGC 
CAAGAACGTT 

20 CCAAGAACGA 
GTCCTCAACC 
CTGCCACACT 
TTGACCGCCG 
TCCGGTGATG 

25 CGAGTCCTTC 



CCGAGCGTGA 
ACTCCCAAGT 
TGTTTTGGTG 
ACTTCATCAA 
CTCATCATTG 
TGGCCAGACC 
AGCTCATCGT 
CGTTACAACG 
CTACAACCCC 
ACAACATGCT 
AAGGAGAC C A 
CGACGCCATT 
CCCTCCAGGA 
CGTGTCGAGA 
CGCCAACGTC 
TCCAGGAGGG 
TCCGTCAAGG 
TCCCCCTCAG 
ACCCCGGTCA 
GCCCACATTG 
TACCGGCAAG 
CCGCCATCGT 
ACTGACTAC C 



GCGTGGTATC 
ATGAGGTCAC 
CTGTAGCTAA 
GAACATGATC 
CCTCCGGTAC 
CGTGAGCACG 
CGCCCTCAAC 
AAATTGTCAA 
AAGGCCGTTC 
TGAGCCCTCC 
AGGCCGGCAA 
GAGCCCCCTG 
TGTCTACAAG 
CCGGTATCAT 
ACCACTGAAG 
TGTCCCCGGT 
AAGTC CGCCG 
GGTGCTGCCT 
GGTCGGCGCT 
CTTGCAAGTT 
TCTGTTGAGA 
GAAGATGGTT 
CCCC 



ACCATCGACA 
TGTCATCGGT 
CACGATCTGA 
ACTGGTACCT 
TGGTGAGTTC 
CTCTGCTGGC 
AAGATGGACA 
GGAAACCTCC 
CCTTCGTCCC 
TCCAACTGCC 
GGTCACTGGT 
TCCGTCCCTY 
ATCTCTGGTA 
CAAGCCCGGC 
TCAAGTCCGT 
GACAACGTCG 
TGGTAACGTC 
CCTTCAACGC 
GGTTACGCCC 
CTCTGAGCTG 
ACAACCCCAA 
CCTTCCAAGC 



TTGCCCTCTG 50 

AAGCTCGACT 100 

AGATGCCCCC 150 

CCCAGGCTGA 2 00 

GAGGCTGGTA 2 50 

TTTCACCCTC 3 00 

CCTGCAAGTG 350 

AACTTCATCA 4 00 

CATCTCTGGC 450 

CCTGGTACAA 500 

AAGACCCTCA 550 

CAACAAGCCC 600 

TCGGAACGGT "650 

ATGGTCGTCA 700 

CGAGATGCAC 750 

GTTTCAACGT 800 

TGCGGTGACT 850 

CCAGGTCATC 900 

CCGTCCTCGA 950 

CTTGAGAAGA 10 00 

GTTCATCAAG 1050 

CCATGTGTGT 1100 
1124 



2 ) INFORMATION FOR SEQ ID NO: 898 



30 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13 63 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 
35 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
40 (A) ORGANISM: Penicillium marneffei 

(D) STRAIN: WSA-214 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8 98 

45 CTCAAGGCTG AGCGTGAGCG TGGTATCACC ATCGATATTG CTCTCTGGAA 50 

GTTCCAGACT GCCAAGTACG AGGTTACCGT CATTGACGCC CCCGGTCACC 100 

GTGATTTCAT CAAGAACATG ATCACTGGTA CCTCCCAGGC CGATTGCGCT 150 
ATTCTCATCA TTGCCTCTGG TACTGGTGAA TTCGAGGCTG GTATCTCCAA * 2 00 

GGATGGCCAG ACTCGTGAGC ACGCTCTTTT GGCTTTCACC CTCGGTGTCC 250 

50 GTCAGCTCAT TGTTGCCCTC AACAAGATGG ACACTTGCAA GTGGTCTCAG 3 00 

GGTGAGTACT CGTACCTGCG TTTGGCCTTG AATATCTTAC TAATGCACCA 350 

TAGATCGTTA CAACGAAATT GTCAAGGAGA CTTCCAACTT CATCAAGAAG 4 00 

GTCGGATACA ACCCTAAGAA CGTTCCTTTC GTTCCTATCT CCGGTTTCAA 450 

CGGTGACAAC ATGCTTGAGC CCTCCCCCAA CTGCCCCTGG TACAAGGGTT 500 

55 GGGAGAAGGA GACCAAGGCC GGTAAGGTCA CTGGTAAGAC CCTCCTCGAG 550 

GCCATCGACG CCATTGAGCC CCCTACCCGT CCCGCCAACA AGGTTAGTCC 600 

CTCCTCGACT ACTCAAACCC TCCTCATAAG TTCATGATTA CGACTCGTTC 650 

ACAGCCCCTC CGTCTTCCCC TCCAGGATGT CTACAAGATC GGTGGTATTG 700 

GAACGGTTCC CGTCGGTCGT GTTGAGACTG GTACCATCGT TCCTGGTATG 750 

60 GTTGTCACCT TGTAAGTCAC TCTCCTCGCT TATCCTACCT GAAATCATCA 800 

504 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 

TGTGCTAACT TGACACTCAG CGCTCCCGCC 
GAGTGTTGAA ATGCACCACC AGCAGCTCAC 
ACGTTGGTTT CAACGTGAAG AACGTCTCCG 
AACGTTGCTG GTGACAGCAA GAACGACCCC 
5 CAACGCCCAG GTCATCGTCC TCAACCACCC 
ACGCCCCAGT CCTCGATTGC CACACTGCCC 
GAGCTCCTCG AGAAGATTGA CCGTCGTACC 
CCCCAAGTTC ATCAAGTCCG GTGACGCTGC 
CCAAGCCCAT GTGTGTTGAG GCTTTCACCG 
10 TTCGCCGTTC GCGAGTAAGT TTTATCTCCG 
CCTTCTCCTC CGTCTTCCAT ATATATTTTT 
ACAAATCACG GGA 



PCT/CA00/01150 

AACGTCACCA CTGAAGTCAA 850 

TGCCGGTCAG CCCGGTGACA 900 

TCAAGGAAAT CCGTCGTGGT 950 

CCTGCCGGTG CTGCCTCCTT 1000 

CGGTCAGGTC GGTGCTGGTT 1050 

ACATTGCTTG CAAGTTCGCT 1100 

GGAAAGTCTG TTGAGGACCA 1150 

CATCGTCAAG ATGATTCCTT 1200 

AGTACCCTCC TCTCGGTCGT 1250 

TTGTCTATTT TCCATCCTTC 13 00 

TCAGTTATAT GTGACTAACC 1350 

1363 



15 



20 



25 



30 



2) INFORMATION FOR SEQ ID NO : 899 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 114 7 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Piedraia hortai 

(B) STRAIN: ATCC 24292 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8 99 



35 



40 



45 



50 



55 



CAAGCTGAAA 
GGAAGTTCGA 
TATCTCACTC 
GTCACCGTGA 
TGCGCTATCC 
CTCCAAGGAT 
GTGTGCGTCA 
TCTGAGGCCC 
GAAGGTCGGC 
TCAACGGCGA 
GGATGGGAGC 
CGAGGCCATT 
TCCGTCTTCC 
CCTGTCGGCC 
CTTCGCTCCC 
ACGAGCAGCT 
AAGAACGTTT 
CAAGAACGAC 
TCCTCAACCA 
TGCCACACTG 
CGACCGCCGT 
CTGGTGACTC 
GAGGCTTTCA 



GCCGAGCGTG 
GACTCCCAAG 
CTCACAGAAG 
TTTCATCAAG 
TCATTATCGC 
GGCCAGACTC 
GCTCATCGTC 
GTTACCAGGA 
TACAACCCCA 
CAACATGCTT 
GTGAGGTCAA 
GACGGCATTG 
TCTCCAGGAT 
GTATCGAGAC 
GCCAACGTCA 
CACTGAGGGT 
CCGTCAAGGA 
CCCGCTCTGG 
CCCCGGTCAG 
CCCACATTGC 
ACCGGCAAGT 
TGCCATCGTC 
CCGACTACCC 



AGCGTGGTAT 
TACTATGTCA 
CACGCTCCTA 
AACATGATCA 
TGCCGGTACT 
GTGAGCACGC 
GCCATCAACA 
GATCATCAAG 
AGACCGTCGC 
GCCCCCTCCA 
GGGCAACAAG 
AGCCCCCCAA 
GTTTACAAGA 
TGGTGTCCTC 
CCACTGAAGT 
CTTCCCGGTG 
CATCCGCCGT 
GTGCCGCTTC 
GTCGGTGCTG 
TTGCAAGTTC 
CTGTTGAGGA 
AAGATGGTTC 
TCCTCTGGGC 



CACTATCGAC 
CCGTCATTGG 
ACATCACACA 
CTGGTACCTC 
GGTGAGTTCG 
CCTGCTCGCC 
AGATGGACAC 
GAGACCTCCA 
TTTCGTCCCC 
CCAACTGCCC 
CAGACCGGCA 
GCGTCCTTCC 
TCGGTGGTAT 
AAGCCCGGTA 
CAAGTCCGTC 
ACAACGTCGG 
GGTAACGTTG 
TTTCGACGCC 
GTTACGCCCC 
TCCGAGATCA 
CGCCCCCAAG 
CCTCCAAGCC 
CGTTTCGCCG 



ATTGCCCTCT 50 

TACGTCGCAT 100 

GACGCTCCCG 150 

CCAGGCCGAC 2 00 

AGGCTGGTAT 250 

TACACCCTCG 3 00 

CACCAAGTGG 350 

ACTTCATCAA 400 

ATCTCTGGCT 450 

CTGGTACAAG 500 

AGACCCTCCT 550 

GACAAGCCCC 600 

CGGAACTGTT 650 

TGGTCGTTAC 700 

GAGATGCACC 750 

TTTCAACGTG 800 

CCAGTGACTC 850 

CAGGTCATCG 900 

GGTCCTCGAT 950 

AGGAGAAGAT 1000 

TTCATCAAGT 1050 

CATGTGCGTT 1100 

TCCGTGA 1147 



2) INFORMATION FOR SEQ ID NO: 900 
60 (i) SEQUENCE CHARACTERISTICS: 



505 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 



PCT/CA00/01150 



(A) LENGTH: 1150 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

5 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Paecilomyces lilacinus 
10 (B) STRAIN: ATCC 60735 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 900 

CTCAAGGCCG AGCGTGAGCG TGGTATCACC ATCGACATTG CCCTCTGGAA 50 

15 GTTCGAGACT CCCAAGTACT ATGTCACCGT CATTGGTACG TCGACTCGCG 100 

CGAGACTGGT CGCAATTTCC ACGTCGCTAA CGTG CTTGAA CAGACGCTCC 150 

CGGCCACCGT GACTTCATCA AGAACATGAT CACTGGTACC TCCCAGGCTG 2 00 

ACTGCGCTAT CCTCATTATC GCTGCCGGCA CTGGTGAGTT CGAGGCTGGT 250 

ATCTCCAAGG ATGGCCAGAC CCGTGAGCAC GCTCTGCTCG CCTACACCCT 300 

2 0 CGGTGTTAAG CAGCTCATCG TCGCTATCAA CAAGATGGAC ACCACCAAGT 350 

GGTCTGAGGC CCGTTTCCAG GAGATCATCA AGGAGACCTC C AACTT CATC 4 00 

AAGAAGGTCG GCTACAACCC CAAGACCGTC GCTTTCGTCC CCATCTCTGG 450 

TTTCCACGGC GACAACATGC TTTCCCCCTC CACCAACTGC CCCTGGTACA 500 

AGGGCTGGGA GAAGGAGACC AAGGCTGGCA AGTCCACCGG CAAGACCCTC 550 

25 CTTGAGGCCA TCGACTCCAT CGAGCCCCCC AAGCGCCCCA GCGACAAGCC 600 

CCTCCGCCTT CCCCTTCAGG ATGTGTACAA GATCGGCGGT AT CGGCAGAG 650 

TCCCTGTCGG CCGTATCGAG ACTGGTGTCA TCAAGCCCGG CATGGTCGTG 70 0 

ACCTTCGCTC CTTCCAACGT CACCACCGAA GTCAAGTCCG TTGAGATGCA 750 

CCACGAGCAG CTCTCCGAGG GTGTCCCCGG TGACAACGTC GGCTTCAACG 800 

3 0 TCAAGAACGT CTCCGTCAAG GAGATCCGTC GTGGCAACGT CGCCGGTGAC 850 

TCCAAGAACG ACCCCCCTCT GGGTGCCGCT TCTTTCGATG CCCAGGTCAT 900 

CGTCCTCAAC CACCCCGGCC AGGTCGGTGC TGGCTACGCC CCCGTCCTCG 950 

ACTGCCACAC CGCCCACATT GCCTGCAAGT TCGCCGAGAT CAAGGAGAAG 1000 

ATCGACCGCC GTACCGGCAA GTCTGTCGAG TCCGCCCCCA AGTTCATCAA 1050 

3 5 GTCTGGCGAC TCTGCCATCG TCAAGATGAT TCCCTCCAAG CCCATGTGCG 1100 

TTGAGGCTTT CACCGACTAC CCTCCTCTGG GCCGCTTCGC CGTCCGTGAC 1150 



40 2) INFORMATION FOR SEQ ID NO: 901 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 751 bases 

(B) TYPE: Nucleic acid 
45 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

50 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Paracoccidioides brasiliensis 

(B) STRAIN: ATCC 32075 



55. 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 901 



TACCACTAAG TGGTCCGAGA CCCGATTCAA TGAAATTATC AAGGAAGTCA 50 

CCAATTTCAT TAAGAAGGTC GGATATAACC CCAAGACTGT TCCTTTCGTT 100 

CCCATTTCTG GTTTCGAGGG TGACAACATG ATCGAGCCCT CTGCCAACTG 150 

CCCATGGTAC AAGGGCTGGT CCAAGGAGAC TGCTCAGGGC AAGTACTCTG 200 

60 GCAAGACCCT TCTTGAGGCC ATCGACGCCA TTGAGCCCCC CACCCGTCCT 250 

506 
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ACCGATAAAC CTCTCCGTCT TCCCCTCCAG 
TATTGGCACT GTTCCTGTCG GACGTGTTGA 
GTATGGTCGT GACCTTCGCT CCCGCCAACG 
GTCGAAATGC ACCACCAGCA GCTTACCGCC 
CGGCTTCAAC GTCAAGAATG TTTCCGTCAA 
TTGCCGGTGA CTCTAAGAAT GATCCCCCAA 
GCCCAGGTCA TCGTCCTCAA CCACCCTGGT 
CCCAGTCCTC GACTGCCATA CTGCCCACAT 
TCCTTGAGAA GATTGATCGA CGAACCGGAA 
AAGTTCATCA AGTCCGGTGA TGCTGCTATC 
G 



PCT/CA00/01150 

GATGTCTACA AGATCTCCGG 300 

GACTGGAGTC ATCAAGCCCG 350 

TCACCACTGA AGTCAAGTCC 4 00 

GGTAACCCCG GTGACAACGT 450 

AGAAGTCCGC CGTGGTAACG 500 

AGGGCTGCGA TTCCTTCAAT 550 

CAGGTTGGCG CTGGTTATGC 600 

TGCCTGCAAA TTCGCTGAGC 650 

AGTCTGTTGA GAACAACCCC .700 

GTCAAGATGA TTCCTTCCAA 750 

751 



2 ) INFORMATION FOR SEQ ID NO: 902 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1056 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Sporothrix schenckii 

(B) STRAIN: ATCC 7968 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 902 

CGTGAGCGCG GTATCACCAT CGATATTGCT CTGTGGAAGT TCGAGACCCC 50 

CAAGTACTAC GTCACCGTCA TTGACGCCCC CGGTCATCGC GATTTCATCA 100 

AGAACATGAT CACTGGTACC TCGCAGGCCG ACTGCGCCAT TCTCATCATT 150 

GCCGCTGGTA CTGGTGAGTT CGAGGCTGGT ATCTCCAAGG ATGGCCAGAC 200 

TCGTGAGCAC GCTCTGCTCG CCTACACCCT GGGTGTGCGG CAGCTGATCG 2 50 

TCGCCATCAA CAAGATGGAC ACGGCCAAGT GGGCTGAGGC TCGTTACCAG 300 

GAGATCATCA AGGAGACCTC CAACTTCATC AAGAAGGTCG GCTACAACCC 350 

CAAGACTGTT GCCTTCGTCC CCATCTCGGG CTTCCACGGC GACAACATGC 400 

TTACTCCCTC GACCAACTGC CCCTGGTACA AGGGCTGGGA GAAGGAGGGC 450 

AAGAGCGGCA AGGTTACCGG TAAGACTCTG CTGGACGCCA TTGACGCCGT 500 

CGAGCCCCCC AAGCGCCCCA CGGACAAGCC CCTGCGTCTG CCCCTCCAGG 550 

ATGTCTACAA GATCGGCGGT ATCGGCACTG TCCCTGTCGG CCGTATCGAG 600 

ACTGGTGTCC TGAAGCCCGG CATGGTCGTC ACCTTTGCCC CGTCCAACGT 650 

CACCACTGAA GTCAAGTCCG TCGAGATGCA CCACGAGCAG CTTGTTGAGG 700 

GTGTTCCCGG CGACAACGTC GGCTTCAACG TCAAGAACGT CTCCGTCAAG 750 

GAGATCCGTC GTGGCAACGT TGCCGGTGAC TCCAAGAACG ACCCCCCCTC 800 

GGGCGCCGCC ACCTTCAACG CCCAGGTCAT TGTCCTGAAC CACCCCGGCC 850 

AGGTCGGCAA CGGCTACGCC CCGGTTCTGG ACTGCCACAC CGCCCACATT 900 

GCCTGCAAGT TCACCGAGAT CCTTGAGAAG ATCGACCGCC GTACCGGCAA 950 

GTCGGTTGAG AACAACCCCA AGTTCATCAA GTCGGGTGAC GCCGCCATTG 1000 

TCAAGCTGAC GCCCTCGAAG CCCATGTGCG TTGAGGCCTT CACTGACTAC 1050 

CCCCCT " 1056 



2) INFORMATION FOR SEQ ID NO: 903 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1366 bases 

(B) TYPE: Nucleic acid 

507 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 PCT/CA00/01150 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DMA 

5 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Penicillium mameffei 

(B) STRAIN: ATCC 58950 

10 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 903 

CAAGGCTGAG CGTGAGCGTG GTATCACCAT CGATATTGCT CTCTGGAAGT 50 

TCCAGACTGC CAAGTACGAG GTTACCGTCA TTGACGCCCC CGGTCACCGT 100 

GATTTCATCA AGAACATGAT CACTGGTACC TCCCAGGCCG ATTGCGCTAT 150 

15 TCTCATCATT GCCTCTGGTA CTGGTGAATT CGAGGCTGGT ATCTCCAAGG 200 

ATGGCCAGAC TCGTGAGCAC GCTCTTTTGG CTTTCACCCT CGGTGTCCGT 250 

CAGCTCATTG TTGCCCTCAA CAAGATGGAC ACTTGCAAGT GGTCTCAGGG 3 00 

TGAGTACTCG TACCTGCGTT TGGCCTTGAA TATCTTACTA ATGCACCATA 350 

GATCGTTACA ACGAAATTGT CAAGGAGACT TCCAACTTCA TCAAGAAGGT 400 

20 CGGATACAAC CCTAAGAACG TTCCTTTCGT TCCTATCTCC GGTTTCAACG 450 

GTGACAACAT GCTTGAGCCC TCCCCCAACT GCCCCTGGTA CAAGGGTTGG 500 

GAGAAGGAGA CCAAGGCCGG TAAGGTCACT GGTAAGACCC TCCTCGAGGC 550 

CATCGACGCC ATTGAGCCCC CTACCCGTCC CGCCAACAAG GTTAGTCCCT 600 

CCTCGACTAC TCAAACCCTC CTCATAAGTT CAGATTACTG ACTCGTTCAC 650 

25 AGCCCCTCCG TCTTCCCCTC CAGGATGTCT ACAAGATCGG TGGTATTGGA 700 

ACGGTTCCCG TCGGTCGTGT TGAGACTGGT ACCATCGTTC CTGGTATGGT 750 

TGTCACCTTG TAAGTCACTC TCCTCGCTTA TCCTACCTGA AATCATCATG 800 

TGCTAACTTG ACACTCAGCG CTCCCGCCAA CGTCACCACT GAAGTCAAGA 850 

GTGTTGAAAT GCACCACCAG CAGCTCACTG CCGGTCAGCC CGGTGACAAC 900 

30 GTTGGTTTCA ACGTGAAGAA CGTCTCCGTC AAGGAAATCC GTCGTGGTAA 950 

CGTTGCTGGT GACAGCAAGA ACGACCCCCC TGCCGGTGCT GCCTCCTTCA 1000 

ACGCCCAGGT CATCGTCCTC AACCACCCCG GTCAGGTCGG TGCTGGTTAC 1050 

GCCCCAGTCC TCGATTGCCA CACTGCCCAC ATTGCTTGCA AGTTCGCTGA 1100 

GCTCCTCGAG AAGATTGACC GTCGTACCGG AAAGTCTGTT GAGGACCACC 1150 

35 CCAAGTTCAT CAAGTCCGGT GACGCTGCCA TCGTCAAGAT GATTCCTTCC 1200 

AAGCCCATGT GTGTTGAGGC TTTCACCGAG TACCCTCCTC TCGGTCGTTT 1250 

CGCCGTTCGC GAGTAAGTTT TATCTCCGTT GTCTATTTTC CATCCTTCCC 1300 

TTCTCCTCCG TCTTCCATAT ATATTTTTTC AGTTATATGT GACTAACCAC 1350 

AAATCACGGG AATAGC 1366 



2) INFORMATION FOR SEQ ID NO: 904 

45 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 841 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



50 



(ii) MOLECULE TYPE : Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Curvularia lunata 
55 (B) STRAIN: ATCC 26425 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 904 

CTATTATCGT TGTTGCCGCT TCCGACGGTC AAATGCCCCA GACTCGTGAG 50 
60 CATCTGCTGC TCGCCCGCCA GGTCGGTGTC CAGAAGATCG TTGTCTTCGT 100 

508 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 

CAACAAGGTC GATGCTGTTG AGGACAAGGA 
TGGAGATGCG CGAATTGCTC AGCAGCTACG 
CCCATCATCA TGGGATCTGC CCTCTGCGCC 
GATTGGTGTC AACCGAATTG ATGAGCTGCT 
5 TCCCCACCCC TCAGCGTGAG ACCGACAAGC 
GATGTCTTCT CCATTGCTGG TCGTGGCACT 
GCGAGGTATC CTGAAGCGCG ATGCTGAAGT 
CCGCCCCCAT CAAGACCAAG GTTACCGATA 
TGCGAGGAGT CTCGCGCTGG TGACAACTCC 

10 CAAGCGTGAT GAAGTCCGCC GTGGTATGGT 
TCAAGGCGCA CAAGAAGTTC CTTGTCTCCA 
GAAGGTGGTC GTCACACTGG CTTCGGTGAG 
CATCCGCACT GCCGACGAGT CGTGTGCCCT 
AGGACGCCCA TGACAAGCTT GTTATGCCCG 

15 TGCGAGCTCC ATGCACCACA CGTCTTGGAG 



PCT/CA00/01150 

GATGTTGGAG CTCGTCGAGA 150 

GCTTCGAGGG CGACGAGACT 200 

ATTGAGGGCC GCGAACCTGA 250 

CGAGGCCGTT GATACTTGGA 300 

CTTTCCTCAT GGCCGTCGAG 350 

GTCGTCTCTG GCCGTGTCGA 4 00 

TGAGCTCGTC GGCAAGGGCA 450 

TCGAGACCTT CAAGAAGTCC 500 

GGTCTCCTTC TTCGTGGTGT 550 

CGTTTCCGTC CCTGGACAGG 600 

TGTACGTGTT GAGCAAGGAG 650 

AACTACAGGC CGCAAATGTT 7 00 

GTACTGGCCA GAAGGCACCG 750 

GTGACAACGT CGAGATGGTT 800 

CCTGGTCAAC G 841 



2) INFORMATION FOR SEQ ID NO: 905 

20 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 967 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 
25 (D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 
3 0 (A) ORGANISM: Aspergillus niger 

(B) STRAIN: ATCC 9508 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 905 



35 CGGTGCTATC ATTGTCGTCG CCGCCTCCGA CGGTCAGATG TACGTTAACC 50 

TTAAAAGAAT AACTCTCCTT CAGTATATAT GCTTACACTG GCGATCAACA 100 

GGCCCCAGAC TCGTGAGCAC TTGCTGCTTG CTCGTCAGGT CGGTGTCCAG 150 

AAGATCGTTG TCTTCGTCAA CAAGGTCGAT GCTATCGATG ACCCCGAGAT 2 00 

GCTGGAGCTC GTTGAGCTGG AAATGCGCGA GCTTCTCAGC ACCTACGGAT 250 

40 TCGAGGGTGA GGAGACCCCC ATCGTCTTCG GCTCTGCTCT CTGCGCCATT 3 00 

. GAGGACCGCC GCCCCGACAT CGGTACCGAG CGTATCGACG CTCTCCTCGA 350 

GGCCGTTGAC ACCTGGATCC CCACTCCCCA GCGTGACCTT GACAAGCCTT 4 00 

TCTTGATGTC CATTGAGGAA GTTTTCTCCA TCCCCGGTCG TGGTACCGTC 450 

GCCTCCGGCC GTGTCGAGCG TGGTCTCCTG AAGCGTGATA GCGAGGTTGA 500 

45 GATCATCGGT ACCACCAACG AGGTCATCAA GACCAAGGTT ACCGACATTG 550 

AGACCTTCAA GAAGTCCTGC TCCGAGTCCC GCGCCGGTGA CAACTCCGGT 600 

CTCCTGCTCC GTGGTGTCCG CCGTGAGGAT CTCCGCCGTG GTATGGTCAT 650 

TGCCGCTCCT GGCAGCGCCA AGGCCAACAG CAAGTTCATG GTCTCCATGT 700 

ACGTCCTGAC CGAGGCTGAR GGTGGTCGCC GTACCGGTTT CGGTGTCCAG 750 

50 TACCGTCCCC AGCTGTTCAT CCGCACTGCC GGTAAGTAAA ATTGCATTCT 800 

ATTCCGCTAC TAGGGAACCA TCTCTAATTC TATTTGCTAC AGATGAGGCT 850 

GCTGAGTTCA GCTTCCCCGA CGGAGACCAG TCCCGCCGTA TCATGCCCGG 900 

TGACAACGTC GAGATGATCG TCAAGACCCA CCGCCCCGTC GCCGCCGAGG 950 

CCGGTCAGCG CTTCAAC 967 

55 



2) INFORMATION FOR SEQ ID NO : 906 

60 (i) SEQUENCE CHARACTERISTICS: 

509 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 



PCT/CA00/01150 



(A) LENGTH: 852 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

5 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bipolaris hawaiiiensis 
10 (B) STRAIN: ATCC 26067 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 906 

TGGTGCTATT ATTGTTGTTG CCGCTTCCGA CGGTCAAATG CCCCAGACTC 50 

15 GTGAGCATCT GCTGCTCGCC CGTCAGGTCG GTGTTCAGAA GATCGTTGTC 100 

TTCGTTAACA AGGTCGACGC TGTCGAGGAC AAGGAGATGT TGGAGCTTGT 150 

CGAGATGGAG ATGCGCGAAC TGCTCAGCAG CTATGGCTTC GAGGGCGACG 2 00 

AGACCCCTAT CATCATGGGT TCTGCTCTCT GCGCCATTGA AGGCCGTCAA 250 

CCCGACATTG GTGTCGAACG AATTGACGAG CTGCTCGAGG CTGTTGATAC 3 00 

20 TTGGATTCCC ACCCCTCAGC GTGAGACCGA AAAGCCTTTC CTCATGGCCG 350 

TCGAGGATGT CTTCTCCATT GCTGGTCGTG GTACCGTCGT CTCTGGCCGT 400 
GTCGAACGAG GTATCCTGAA GCGCGATGCT GAAGTTGAGC TTGTGGGCAA " 450 

GGGCAGCGCA CCCATCAAGA CCAAGGTTAC CGATATCGAG ACCTTCAAGA 500 

AGTCTTGCGA GGAGTCCCGC GCTGGTGACA ACTCCGGTCT CCTTCTTCGT 550 

25 GGTGTTAAGC GTGATGAAGT CCGCCGTGGT ATGGTCGTTT CCGTCCCTGG 600 

ACAGGTTAAG GCGCACAAGA AGTTCCTTGT CTCCATGTAT GTGCTGAGCA 650 

AGGAGGAAGG TGGCCGACAC ACTGGCTTCG GTGAGAACTA CAGGCCGCAA 700 

ATGTTCATCC GCACTGCCGA CGAGTCGTGT GCCCTGTACT GGCCAGAAGG 750 

CACCGAGGAT GCCCACGACA AGCTTGTCAT GCCCGGTGAC AACGTCGAGA 800 

30 TGGTTTGCGA GCTCCATGCA CCACACGTCT TGGAGACTGG TCAGCGCTTC 850 

AA 852 



35 2 ) INFORMATION FOR SEQ ID NO: 907 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 966 bases 

(B) TYPE: Nucleic acid 
40 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

45 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Aspergillus flavus 

(B) STRAIN: ATCC 26947 



50 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 907 



GGTGCTATTG TTGTCGTTGC TGCTTCGGAT GGTCATGATG TATGGACAGG 50 

CCCTTTGCTA CTGAATGGTT TCAAGATCTC GCGCTTACAC GTATTATAAT 100 

AGGCCCCAGA CCCGGGAGCA CTTGCTGCTT GCCCGTCAGG TCGGTGTCCA 150 

GAAGATCGTC GTTTTTGTCA ACAAGATTGA TGCCGTTGAG GACCCTGAGA 2 00 

55 TGTTGGAGCT TGTCGAGTTG GAAATGCGCG AGCTCCTTAG CAGCTACGGC 250 

TTCGAGGGCG AAGAGACTCC CATCATCTTC GGTTCTGCTC TGTGTGCTTT 300 

GGAGGACCGT CGCCCCGACA TTGGTGCCGA GCGTATCGAC GAGCTCATGA 3 50 

AGGCCGTTGA CACCTGGATC CCTACCCCTC AGCGTGATCT TGACAAGCCT 4 00 

TTCCTCATGT CTGTCGAGGA AGTCTTCTCC ATCGCCGGTC GTGGTACCGT 450 

60 TGCCTCCGGC CGTGTCGAAC GTGGTATCCT GAAGAAGGAC AGCGAAGTCG 500 

510 
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AGATCATCGG AGGTAGCTTC GATGCTACCA 
GAGACCTTCA AGAAGTCTTG TGACGAGTCC 
CTTGCTTCTC CGTGGTATCC GTCGTGAAGA 
TTGCTGCTCC TGGCAGCACC AAGGCCCACG 
5 TACGTTCT C A CTGAGGCTGA GGGTGGTCGT 
CTACCGCCCC CAGGTGTTCG TTCGCACTGC 
TCACTTAACG GTATTGATTA AGTTCTAACT 
GCTGACCTCA GCTTCCCCGA CGGTGATGAG 
TGACAACGTC GAGATGGTCC TCAAGACTCA 
10 CTGGCCAGCG CTTCAA 



PCT/CA00/01150 

AGACCAAGGT CACCGACATT" 550 

CGCGCTGGTG ACAACTCTGG 600 

CGTCCGCCGC GGAATGATCA 650 

ACCAGTTCTT GGTGTCCATG 700 

CGTACTGGCT TCGGCTCCAA 750 

TGGTAAGTCA AGCCTTTTGC 800 

GTTGTATCCT AGATGAGGCT 85 0 

TCCCGGAGGG TGATGCCTGG 90 0 

CCGCCCCATT GCTGCTGAGG 950 

966 



2) INFORMATION FOR SEQ ID NO: 908 



15 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 845 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

2 0 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
25 (A) ORGANISM: Altemaria alternate 

(B) STRAIN: ATCC 62 099 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 908 

3 0 GGTGCTATCA TCGTCGTTGC TGCTTCCGAT GGTCAGATGC CCCAGACCCG 50 

TGAGCACTTG CTGCTCGCCC GTCAGGTCGG TGTTCAGAAG ATCGTTGTCT 10 0 

TCGTCAACAA GGTCGATGCT GTCGAAGACC CGGAGATGTT GGAACTCGTC 150 

GAGATGGAGA TGCGTGAGTT ACTCACCAGC TACGGCTTCG AGGGCGACGA 2 00 

GACACCCATC ATCATGGGTT CCGCTCTATG CGCCATCGAG GGCCGCCAGC 2 50 

35 CCGAGATCGG TGTTACCAAG GTCGACGAGC TAATGGACGC TGTCGACTCA 3 00 

TGGATCCCCA CCCCTCAGCG TGAGACCGAG AAGCCTTTCC TCATGGCTGT 350 

TGAGGATGTC TTCTCGATTG CTGGACGTGG TACCGTCGTT TCGGGCCGTG 4 00 

TCGAGCGCGG TATCTTGAAG CGTGACGCTG AAGTCGAGCT TGTCGGCAAG 4 50 

GGCACCGCGC CAATCAAGAC CAAGGTCACT GATATTGAGA CCTTCAAGAA 500 

40 GTCGTGCGAG GAGTCGCGCG CGGGTGATAA CTCCGGTCTT CTCCTCCGTG 550 
GTGTCAAGCG TGATGACGTT CGCCGCGGTA TGGTTGTTTC CGTTCCCGGA . 600 

CAAGTCAAGG CTCACAAGAA GTTCCTTGTC TCCATGTACG TTCTAAGCAA 650 

AGAGGAGGGT GGTCGTCACA CCGGCTTCGG CGAGAACTAC AGGCCGCAAA 700 

TGTTCATCCG AACTGCCGAT GAATCCTGCG CACTTCACTT CCCAGAGGGT 750 

45 ACCGAGGATG CGCACGACAA GCTAGTTATG CCCGGTGACA ACGTCGAGAT 800 

GGTCTGCGAA CTCCACCAGC CCCACGTTCT AGAGACCGGT CAGCG 84 5 



50 2) INFORMATION FOR SEQ ID NO: 909 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 931 bases 

(B) TYPE: Nucleic acid 
55 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 
60 (vi) ORIGINAL SOURCE: 

511 
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(A) ORGANISM: Peniclllium marneffei 

(B) STRAIN: ATCC 64101 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 909 

5 

CGCTGTTGTT GTCGTCGCTG CTTCTGATGG TCAAATGTAA CATATCCACG 50 

AGCTGCCAAT TATGGACACT GCTGATAAGA ATAGGCCCCA AACCCGTGAG 100 

CACTTGCTCC TCGCCCGTCA GGTCGGTGTT CAAAAGATCG TCGTCTTCGT 150 

CAACAAGGTT GATGCCGTCG AGGACCCCGA GATGTTGGAA CTTGTCGAAT 200 

10 TGGAAATGCG TGAACTCTTG ACCACCTACG GTTTCGAGGG TGAAGAGACC 250 

CCTATCATTT TCGGATCCGC TCTTTGCGCC TTGGAAGGCC GCAAGCCCGA 3 00 

GATTGGCGAA CAGAAGATTG ACGAGCTCAT GAACGCCGTT GATACCTGGA 350 

TCCCCACCCC CCAGCGTGAC CTTGACAAGC CCTTCTTGAT GTCCGTTGAG 4 00 

GAAGTTTTCT CCATCTCTGG TCGTGGTACC GTTGCATCTG GTCGTGTTGA 450 

15 GCGTGGTATT TTGCGCAAGG ATT CTGAGGT TGAGATTATC GGATACCAGA 500 

AGAACCCTAT CAAGACCAAG GTTACCGACA TTGAGACCTT CAAGAAGTCT 550 

TGCGATGAAT CTCGTGCTGG TGACAACTCT GGCTTGCTTC TCCGTGGTAT 600 

CAAGCGTGAG GACATTCGTC GTGGTATGGT TATCGCTGCT CCTGGAACCA 650 

CCAAGGCTCA TGACAACTTC TTGGTCTCCA TGTATGTCTT GACTGAGGCT 700 

2 0 GAAGGTGGTC GTCGTACTGG ATTCGGCGCC AACTACCGTC CTCAAGCTTT 750 

CATCCGTACT GCCGGTATGT TCCCTTTCAA AGTCAATTAA TGAGCGATTT 8 00 

GCTAACGAGT TATAGATGAG GCTGCTACTC TCAGCTTCCC CGGTGACGAT 8 50 

CAGTCCAAGC AGGTCATGCC CGGTGACAAC GTTGAGATGA TCTTGAAGAC 900 

ACACCGTCCC GTTGCCGCCG AAGCTGGTCA G 931 

25 



2) INFORMATION FOR SEQ ID NO: 910 

3 0 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 931 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



35 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Penicillium marneffei 
40 (B) STRAIN: ATCC 58950 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 910 

CGCTGTTGTT GTCGTCGCTG CTTCTGATGG TCAAATGTAA CATATCCACG 50 

45 AGCTGCCAAT TATGGACACT GCTGATAAGA ATAGGCCCCA AACCCGTGAG 100 

CACTTGCTCC TCGCCCGTCA GGTCGGTGTT CAAAAGATCG TCGTCTTCGT 150 

CAACAAGGTT GATGCCGTCG AGGACCCCGA GATGTTGGAA CTTGTCGAAT 2 00 

TGGAAATGCG TGAACTCTTG ACCACCTACG GTTTCGAGGG TGAAGAGACC 250 

CCTATCATTT TCGGATCCGC TCTTTGCGCC TTGGAAGGCC GCAAGCCCGA 300 

50 GATTGGCGAA CAGAAGATTG ACGAGCTCAT GAACGCCGTT GATACCTGGA 3 50 

TCCCCACCCC CCAGCGTGAC CTTGACAAGC CCTTCTTGAT GTCCGTTGAG 4 00 

GAAGTTTTCT CCATCTCTGG TCGTGGTACC GTTGCATCTG GTCGTGTTGA 450 

GCGTGGTATT TTGCGCAAGG ATTCTGAGGT TGAGATTATC GGATACCAGA 500 

AGAACCCTAT CAAGACCAAG GTTACCGACA TTGAGACCTT CAAGAAGTCT 550 

55 TGCGATGAAT CTCGTGCTGG TGACAACTCT GGCTTGCTTC TCCGTGGTAT 600 

CAAGCGTGAG GACATTCGTC GTGGTATGGT TATCGCTGCT CCTGGAACCA 650 

CCAAGGCTCA TGACAACTTC TTGGTCTCCA TGTATGTCTT GACTGAGGCT 700 

GAAGGTGGTC GTCGTACTGG ATTCGGCGCC AACTACCGTC CTCAAGCTTT 750 

CATCCGTACT GCCGGTATGT TCCCTTTCAA AGTCAATTAA TGAGCGATTT 800 

60 GCTAACGAGT TATAGATGAG GCTGCTACTC TCAGCTTCCC CGGTGACGAT 850 

512 
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CAGTCCAAGC AGGTCATGCC CGGTGACAAC GTTGAGATGA TCTTGAJtGAC 90Q- 
ACACCGTCCC GTTGCCGCCG AAGCTGGTCA G 931 



2) INFORMATION FOR SEQ ID NO: 911 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 911 
GACGGMKKCA TGCCGCARAC 



2) INFORMATION FOR SEQ ID NO: 912 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 0 bases 

(B) TYPE : Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 912 
GACGGCGKCA TGCCGCARAC 



2 ) INFORMATION FOR SEQ ID NO: 913 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 0 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY : Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 913 
GACGGYSYCA TGCCKCAGAC 



2 ) INFORMATION FOR SEQ ID NO: 914 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 bases 

(B) TYPE : Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

513 
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(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 914 

5 

GAARAGCTGC GGRCGRTAGT G 21 

10 2 ) INFORMATION FOR SEQ ID NO: 915 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 21 bases 

(B) TYPE: Nucleic acid 
15 (C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 
20 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 915 

AAACGCCTGA GGRCGGTAGT T 21 



25 



35 



40 



2) INFORMATION FOR SEQ ID NO: 916 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 2 0 bases 
3 0 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 916 
GCCGAGCTGG CCGGCTTCAG 20 

2) INFORMATION FOR SEQ ID NO: 917 



(i) SEQUENCE CHARACTERISTICS: 
45 (A) LENGTH: 22 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

50 (ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 917 
TCGTGCTACC CGTYGCCGCC AT 22 

55 

2) INFORMATION FOR SEQ ID NO: 918 
60 (i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 1391 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

5 

(ii) MOLECULE TYPE: Genomic DNA 

(Vi) ORIGINAL SOURCE: 

(A) ORGANISM: Escherichia coli 
10 (E) ACCESSION NUMBER: J01672 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 918 

AGAGAAGCCT GTCGGCACCG TCTGGTTTGC TTTTGCCACT GCCCGCGGTG 50 

15 AAGGCATTAC CCGGCGGGAT GCTTCAGCGG CGACCGTGAT GCGGTGCGTC 100 

GTCAGGCTAC TGCGTATGCA TTGCAGACCT TGTGGCAACA ATTTCTACAA 150 

AACACTTGAT ACTGTATGAG CATACAGTAT AATTGCTTCA ACAGAACATA 200 

TTGACTATCC GGTATTACCC GGCATGACAG GAGTAAAAAT GGCTATCGAC 250 

GAAAACAAAC AGAAAGCGTT GGCGGCAGCA CTGGGCCAGA TTGAGAAACA 3 00 

20 ATTTGGTAAA GGCTCCATCA TGCGCCTGGG TGAAGACCGT TCCATGGATG 350 

TGGAAACCAT CTCTACCGGT TCGCTTTCAC TGGATATCGC GCTTGGGGCA 400 

GGTGGTCTGC CGATGGGCCG TATCGTCGAA ATCTACGGAC CGGAATCTTC 450 

CGGTAAAACC ACGCTGACGC TGCAGGTGAT CGCCGCAGCG CAGCGTGAAG 500 

GTAAAACCTG TGCGTTTATC GATGCTGAAC ACGCGCTGGA CCCAATCTAC 550 

25 GCACGTAAAC TGGGCGTCGA TATCGACAAC CTGCTGTGCT CCCAGCCGGA 600 

CACCGGCGAG CAGGCACTGG AAATCTGTGA CGCCCTGGCG CGTTCTGGCG 650 

CAGTAGACGT TATCGTCGTT GACTCCGTGG CGGCACTGAC GCCGAAAGCG 700 

GAAATCGAAG GCGAAATCGG CGACTCTCAC ATGGGCCTTG CGGCACGTAT 750 

GATGAGCCAG GCGATGCGTA AGCTGGCGGG TAACCTGAAG CAGTCCAACA 800 

30 CGCTGCTGAT CTTCATCAAC CAGATCCGTA TGAAAATTGG TGTGATGTTC 850 

GGTAACCCGG AAACCACTAC CGGTGGTAAC GCGCTGAAAT TCTACGCCTC 900 

TGTTCGTCTC GACATCCGTC GTATCGGCGC GGTGAAAGAG GGCGAAAACG 950 

TGGTGGGTAG CGAAACCCGC GTGAAAGTGG TGAAGAACAA AATCGCTGCG 1000 

CCGTTTAAAC AGGCTGAATT CCAGATCCTC TACGGCGAAG GTATCAACTT 1050 

35 CTACGGCGAA CTGGTTGACC TGGGCGTAAA AGAGAAGCTG ATCGAGAAAG 1100 

CAGGCGCGTG GTACAGCTAC AAAGGTGAGA AGATCGGTCA GGGTAAAGCG 1150 

AATGCGACTG CCTGGCTGAA AGATAACCCG GAAACCGCGA AAGAGATCGA 1200 

GAAGAAAGTA CGTGAGTTGC TGCTGAGCAA CCCGAACTCA ACGCCGGATT 1250 

TCTCTGTAGA TGATAGCGAA GGCGTAGCAG AAACTAACGA AGATTTTTAA 1300 

40 TCGTCTTGTT TGATACACAA GGGTCGCATC TGCGGCCCTT TTGCTTTTTT 1350 

AAGTTGTAAG GATATGCCAT GACAGAATCA ACATCCCGTC G 1391 



45 2) INFORMATION FOR SEQ ID NO: 919 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 
50 (C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 
55 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 919 

GGICCIGART CITMIGGIAA RAC 23 



60 
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2 ) INFORMATION FOR SEQ ID NO: 920 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 92 0 
TCICCVATIT CICCITCIAI YTC 23 

2 ) INFORMATION FOR SEQ ID NO: 921 



(i) SEQUENCE CHARACTERISTICS: 
2 0. (A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

25 (ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 921 
TIYRTIGAYG CIGARCAIGC 20 

30 

2) INFORMATION FOR SEQ ID NO: 922 

35 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

40 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 922 
45 TARAAYTTIA RIGCIYKICC ICC 23 



2) INFORMATION FOR SEQ ID NO: 923 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 
55 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 923 

60 
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GACGCIGCCA TCCTGATGAT C 
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21 



5 2) INFORMATION FOR SEQ ID NO: 924 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 bases 

(B) TYPE: Nucleic acid 
10 (C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 
15 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 924 

ACCTCAGTCG TCACGTTGGC G 21 



20 



30 



35 



60 



2) INFORMATION FOR SEQ ID NO: 925 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 20 bases 
25 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 925 
AAGCAGATGG TTGTGTGCTG 2 0 

2) INFORMATION FOR SEQ ID NO: 926 



(i) SEQUENCE CHARACTERISTICS : 
40 (A) LENGTH: 24 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

45 (ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 92 6 
CAGCTGCTCG TGGTGCATCT CGAT 24 

50. 

2) INFORMATION FOR SEQ ID NO: 927 

55 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 
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(ii) MOLECULE TYPE: DNA 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 927 
5 ACGCGGAGAA GGTGCGCTT 19 



10 



2) INFORMATION FOR SEQ ID NO: 928 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 
15 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 928 

20 

GGTCGTTCTT CGAGTCACCG CA 22 



25 2 ) INFORMATION FOR SEQ ID NO : 929 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 44 8 bases 

(B) TYPE: Nucleic acid 
30 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 



35 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bacteroides fragilis 

(B) STRAIN: ATCC 25285 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 929 

40 

TTCAGCATGC CATTTCAAAA CAGGCCGAAG CCGATATCGT GATTATCGCT 50 

GCTTGTGGGG AGCGTGCAAA TGAAGTTGTG GAAATCTTTA CCGAATTTCC 100 

GGAATTGGTG GACCCGCACA CGGGACGTAA GCTGATGGAG CGTACCATTA 150 

TTATCGCAAA TACATCGAAC ATGCCGGTAG CAGCGCGTGA AGCTTCTGTG 200 

45 TATACGGCCA TGACGATTGC CGAATACTAT CGTGCCATGG GATTGAAAGT 250 

CCTGCTGATG GCAGACTCCA CTTCCCGTTG GGCGCAGGCA TTGCGTGAGA 300 

TGTCGAACCG TATGGAGGAG TTGCCCGGAC CGGATGCATT CCCGATGGAC 350 

CTGTCCTCAA TCATTTCTAA CTTCTATGGC CGTGCAGGCT ACGTGAAACT 400 

GAATAACGGC GAGAGCGGTT CTATTACCTT TATCGGTACA GTATCACC 448 

50 



2) INFORMATION FOR SEQ ID NO: 930 

55 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 438 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

60 
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(ii) MOLECULE TYPE: Genomic DNA 

<vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bacteroides distasonis 
5 (B) STRAIN: ATCC 8503 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 930 

GCTATCTCTA AACAAGCGGA AGCGGATATC GTGATTATCG CCGCCTGCGG 50 

10 TGAGCGTGCG AATGAGGTCG TAGAGGTATT TACGGAGTTC CCGGAATTGG 100 

TAGACCCGCA TACGGGACGT AAATTGATGG AACGTACGAT CATTATCGCC 150 

AATACATCCA ACATGCCGGT AGCCGCTCGT GAGGCATCCG TATATACGGC 2 00 

GATGACCATC GCCGAGTATT ATCGCAGCAT GGGTTTGAAG GTTCTGTTGA 250 

TGGCCGACTC TACTTCCCGC TGGGCACAGG CTTTGCGTGA GATGTCCAAC 3 00 

15 CGTTTGGAGG AGTTGCCGGG ACCGGATGCT TTCCCGATGG ACTTGTCCGC 3 50 

TATCGTGGCG AACTTCTACG CTCGTGCGGG ATTCGTTCAT TTGAATAACA 400 

ACGCTACAGG CTCCGTCACT TTCATCGGTA CGGTATCG 438 



20 



30 



2 ) INFORMATION FOR SEQ ID NO: 931 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 453 bases 
25 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Porphyromonas asaccharolytxca 

(B) STRAIN: ATCC 25260 

35 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 931 

CCTCCAGCAC GCTATCTCTA AGCAGGCGGA GGCTGATATC GTCATTATGG 50 

CAGCCTGCGG TGAGCGTGCT AATGAGGTGG TGGAGATCTT TGCCGAGTTC 100 

CCTGAGCTCG AAGACCCACA CACGGGACGC AAGCTGATGG AGCGTACGAT 150 

40 CATCATCGCT AACACGAGTA ACATGCCAGT GGCTGCTCGT GAGGCTTCGG 200 

TCTACACCGC TATGACCATC GCTGAGTACT ACCGCTCGAT GGGTCTCAAA 250 

GTACTCCTAA TGGCTGACTC GACCTCTCGC TGGGCACAGG CACTGCGTGA 300 

GATGTCTAAC CGTCTAGAGG AGCTGCCTGG ACCAGATGCA TTCCCGATGG 350 

ACTTGTCGGC TATCGTGGCA AACTTCTACG CTCGTGCCGG CTTCGTCTAT 400 

45 CTCAACAACG GTGAGACAGG TTCTGTAACC TTCATCGGTA CGGTCTCTCC 450 

AGC 453 



50 2) INFORMATION FOR SEQ ID NO: 932 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 835 bases 

(B) TYPE: Nucleic acid 
55 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 
60 (vi) ORIGINAL SOURCE: 
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(A) ORGANISM: Listeria monocytogenes 

(B) STRAIN: ATCC 15313 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 932 

CGGAGCTATC TTAGTAGTAT CTGCTGCTGA TGGCCCAATG CCACAAACTC 50 

GTGAACATAT CTTACTTTCA CGTCAAGTTG GTGTTCCATA CATCGTTGTA 100 

TTCATGAACA AATGTGACAT GGTTGACGAT GAAGAATTAC TAGAATTAGT 150 

TGAAATGGAA ATTCGTGATC TATTAACTGA ATATGAATTC CCTGGCGATG 200 

ACATTCCTGT AATCAAAGGT TCAGCTCTTA AAGCACTTCA AGGTGAAGCT 250 

GACTGGGAAG CTAAAATTGA CGAGTTAATG GAAGCTGTAG ATTCTTACAT 300 

TCCAACTCCW GAACGTGATA CTGACAAACC ATTCATGATG CCAGTTGAGG 350 

ATGTATTCTC AATCACTGGT CGTGGAACAG TTGCAACTGG ACGTGTTGAA 4 00 

CGTGGACAAG TTAAAGTTGG TGACGAAGTA GAAGTTATCG GTATCGAAGA 450 

AGAAAGCAAA AAAGTAGTAG TAACTGGAGT AGAAATGTTC CGTAAATTAC 500 

TAGACTACGC TGAAGCTGGC GACAACATTG GCGCACTTCT ACGTGGTGTT 550 

GCTCGTGAAG ATATCCAACR TGGTCAAGTA TTAGCTAAAC CAGGTTCGAT 600 

TACTCCACAC ACTAACTTCA AAGCTGAAAC TTATGTTTTA ACTAAAGAAG 650 

AAGGTGGACG TCACACTCCA TTCTTCAACA ACTACCGCCC ACAATTCTAT 700 

TTCCGTACTA CTGACGTAAC TGGTATTGTT ACACTTCCAG AAGGTACTGA 750 

AATGGTAAYG CCTGGTGATA ACATTGAGCT TGCAGTTGAA CTAATTGCAC 800 

CAATCGCTAT CGAAGACGGT ACTAAATTCT CTATC 835 



2) INFORMATION FOR SEQ ID NO : 933 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 933 
CATCATCGTI TTCMTGAACA ARTG 



2) INFORMATION FOR SEQ ID NO: 934 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 934 
TCACGYTTRR TACCACGCAG IAGA 



2 ) INFORMATION FOR SEQ ID NO: 935 

(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 935 
GGIAARWSIC ARYTITGYCA YAC 



2) INFORMATION FOR SEQ ID NO : 936 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 0 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 936 
TCISIYTCIG GIARRCAIGG 



2) INFORMATION FOR SEQ ID NO: 937 

(i) SEQUENCE CHARACTERISTICS; 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 937 
AT I AC I GARG YITTYGGIGA RTT 



2) INFORMATION FOR SEQ ID NO: 938 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 938 
CYIGTIGYIS WIGCRTGIGC 
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2 ) INFORMATION FOR SEQ ID NO: 93 9 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1203 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Sac char omyces cerevisiae 
(C) ACCESSION NUMBER: D10023 

15 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 93 9 

ATGTCTCAAG TTCAAGAACA ACATATATCA GAGTCACAGC TTCAGTACGG 50 

GAACGGTTCG TTGATGTCCA CTGTACCAGC AGACCTTTCA CAGTCAGTCG 100 

TTGATGGAAA CGGCAACGGT AGCAGCGAAG ATATTGAGGC CACCAACGGC 150 

20 TCCGGCGATG GTGGCGGATT GCAGGAGCAA GCGG2^AGCGC AAGGTGAAAT 200 

GGAGGATGAA GCATACGATG AAGCTGCCTT AGGTTCGTTT GTGCCAATAG 250 

AAAAACTGCA AGTGAACGGG ATTACTATGG CGGATGTGAA AAAACTAAGG 300 

GAGAGTGGGC TTCACACTGC TGAAGCGGTA GCATATGCTC CCAGAAAGGA 350 

TTTATTGGAA ATCAAAGGTA TATCGGAAGC TAAGGCAGAT AAGTTGCTAA 400 

25 ACGAAGCGGC AAGGCTAGTG CCTATGGGAT TTGTCACGGC TGCTGATTTT 450 

CATATGAGAA GATCGGAGCT GATTTGTTTG ACAACGGGTT CTAAGAATTT 500 

GGACACTCTT TTGGGTGGTG GTGTGGAAAC TGGTTCTATT ACTGAGCTTT 550 

TCGGTGAATT CAGGACAGGT AAGTCCCAGC TATGTCACAC TTTGGCCGTG 600 

ACATGCCAAA TTCCATTGGA TATTGGTGGC GGTGAAGGTA AGTGTTTGTA 650 

3 0 TATCGATACC GAAGGTACTT TCAGGCCGGT AAGATTGGTA TCCATAGCTC 700 

AGCGGTTCGG ATTAGACCCG GATGATGCTT TGAACAACGT TGCGTATGCA 750 

AGAGCCTATA ACGCCGATCA TCAGTTAAGA CTTCTGGATG CTGCTGCCCA 800 

AATGATGAGC GAGTCTCGGT TTTCCTTGAT TGTGGTCGAT TCTGTTATGG 850 

CTCTATACCG TACGGATTTT TCTGGTCGTG GTGAACTAAG CGCAAGGCAA 900 

35 ATGCATTTAG CCAAATTTAT GCGTGCTTTG CAAAGGCTGG CCGACCAATT 950 

TGGTGTTGCA GTCGTCGTTA CTAACCAAGT GGTCGCCCAA GTTGATGGTG 1000 

GTATGGCTTT TAATCCAGAT CCAAAGAAGC CTATCGGTGG TAATATTATG 1050 

GCACATTCTT CCACCACGCG ATTAGGTTTC AAAAAGGGTA AGGGATGTCA 110 0 

AAGATTATGC AAAGTTGTTG ACTCACCTTG CTTACCAGAG GCTGAATGTG 1150 

40 TGTTCGCGAT CTATGAAGAT GGTGTTGGTG ACCCCAGAGA AGAAGACGAG 1200 

TAG 1203 



45 2) INFORMATION FOR SEQ ID NO: 940 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1800 bases 

(B) TYPE: Nucleic acid 
50 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

55 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Sac char omyces cerevisiae 

(B) STRAIN: GRF88 

(C) ACCESSION NUMBER: M87549 

60 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 940 
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CGATCCAATT GCTGGTCTTA AGATGCATTT GATTGATCTA GGTATTGCCJ^ 50 
CTGAAGCTGA AGTCAAAGCT TACGACAAGT CCGCTAGAAA ATACGTTGAC 100 
GAACAAGTTG AATTAGCTGA TGCTGCTCCT CCTCCAGAAG CCAAATTATC 150 

5 CATCTTGTTT GAAGACGTCT ACGTGAAAGG TACAGAAACT CCAACCCTAA 2 00 

GAGGTAGGAT CCCTGAAGAT ACTTGGGACT TCAAAAAGCA AGGTTTTGCC 2 50 

TCTAGGGATT AATTAAATCG TAAGGAAAAA TAAAATAATA GTGCTGTGAT 3 00 

CGCATGATAT TCTTCCCTGG AAGCGCCATT TTATAGCAAG AAATGTAAGT 35 0 

CAAGTATATT TTAACTGTAT ATACAACAAT ATGACTCTTT TTTATGCCTT 400 

10 GTTGTTTTTC TTCGGGTTTT CCCACACATT GTGTGGAGAG ATAGTTATTA 450 

ACAGACCGAA AATAGCCGCC CAAGGATAAA CTTTTATATA AAGGGAAGGG 500 

TAGTTGACCC AAAAATTTGG ATTCTACTTT CCAGATTTAC TTTCACCCTT 550 

TTATATTTGC TGTAGTCTGT TATGCCAATC AGGAAAGCAT TTGAACAAAT 600 

ATGTCTGTTA CAGGAACTGA GATCGATAGT GATACAGCAA AAAATATTCT 650 

15 TAGTGTAGAT GAATTACAGA ACTATGGTAT TAATGCCTCA GATCTTCAAA 700 

AATTGAAGTC TGGTGGGATA TACACAGTCA ATGTATGTTA TAATAACATT 750 

TTTAAAACCT CTGCTGTAGA GGTTCTTTCC CCCTTTCTTT TACTAACTAA 800 

TAATTTGGAA AGGAACTTTT ATAGACCGTT TTGTCAACAA CAAGAAGACA 850 

TCTATGTAAA ATTAAAGGGT TAAGTGAGGT GAAAGTGGAA AAAATTAAAG 900 

2 0 AAGCTGCTGG AAAAATCATA CAGGTGGGAT TTATCCCCGC TACTGTACAA 950 

TTAGATATAA GACAGCGTGT GTACTCCTTA TCAACTGGAT CTAAGCAACT 1000 

AGATTCAATC CTAGGTGGTG GAATAATGAC AATGAGTATC ACTGAAGTAT 1050 

TTGGTGAATT TAGGTGTGGT AAGACACAGA TGTCTCATAC TTTGTGTGTT 1100 

ACCACGCAGC TTCCGAGGGA AATGGGTGGT GGTGAAGGGA AAGTAGCATA 1150 

25 TATTGATACA GAAGGCACTT TCAGGCCCGA GAGGATTAAG CAAATTGCAG 1200 

AAGGTTATGA ATTGGATCCC GAGTCATGTT TGGCAAACGT TTCATATGCT 1250 

AGAGCCTTGA ATAGTGAACA TCAAATGGAA CTTGTTGAAC AATTGGGTGA 1300 

AGAACTTAGT TCTGGAGATT ATCGCCTTAT CGTGGTAGAT TCTATAATGG 1350 

CAAACTTCAG AGTAGACTAC TGCGGTAGAG GTGAACTAAG CGAAAGACAG 1400 

3 0 CAAAAGCTAA ATCAACATCT TTTCAAATTG AATAGATTGG CAGAGGAATT 1450 

TAATGTTGCA GTATTTCTGA CAAACCAAGT TCAATCAGAC CCAGGTGCTT 1500 

CTGCATTATT TGCCTCGGCA GATGGTAGGA AACCAATTGG AGGGCACGTT 1550 

CTGGCACATG CGTCAGCAAC AAGGATTTTG TTGAGAAAAG GGCGTGGTGA 1600 

CGAAAGAGTT GCCAAGTTAC AAGATTCCCC AGATATGCCT GAAAAAGAAT 1650 

35 GTGTCTACGT AATTGGTGAA AAAGGTATTA CCGATTCAAG TGACTAGTTT 17 00 

TTGTATACTT TTTTAATGAA GATGACATTG CTCCTTTATT AAACTTTTCT 1750 

TTTACTTTGT GTTACTAATA TTATTAATAT CTTGTTATGA TTCTTTGTTT 18 00 
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2 ) INFORMATION FOR SEQ ID NO: 941 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 430 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Cryptococcus hnmicolus 

(B) STRAIN: ATCC 382 94 

15 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 941 

CGTCCTTATC CAGGAGCTCA TTAACAACAT TGCCAAGGCC CACGGTGGTT 5 0 

TCTCCGTCTT CACCGGTGTC GGTGAGCGTA CCCGTGAGGG TAACGACCTG 100 

TACCACGAGA TGCGTGAGAC TGGTGTCATC AACCTCGAGG GCGACTCCAA 150 

2 0 GGTCGCTCTC GTCTTCGGCC AGATGAACGA GCCCCCCGGA GCCCGTGCCC 200 

GTGTCGCCCT TACCGGCCTC ACCATCGCCG AGTACTTCCG TGACGAGGAG 250 

GGTCAGGACG TGCTTCTCTT CATCGACAAC ATTTTCCGTT TCACCCAGGC 3 00 

CGGTTCCGAG GTGTCTGCCC TTCTCGGTCG TATCCCCTCG GCCGTCGGTT 350 

ACCAGCCCAC CCTCGCTACC GACATGGGTT CCATGCAGGA GCGTATCACC 400 

25 ACCACCAAGA AGGGTTCGAT TACCTCCGTC 43 0 



2 ) INFORMATION FOR SEQ ID NO: 942 



30 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 794 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
35 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
40 (A) ORGANISM: Escherichia coli 

(B) STRAIN: ATCC 43895 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 942 

45 CGTGTACGAT GCTCTTGAGG TGCAAAATGG TAATGAGCGT CTGGTGCTGG 50 

AAGTTCAGCA GCAGCTCGGC GGCGGTATCG TGCGTACCAT CGCAATGGGT 100 

TCCTCCGACG GTCTGCGTCG CGGTCTGGAT GTAAAAGACC TCGAACACCC 150 

GATCGAAGTC CCGGTAGGTA AAGCGACTCT GGGCCGTATC ATGAACGTAC 2 00 

TGGGTGAACC GGTCGACATG AAAGGCGAGA TCGGTGAAGA AGAGCGTTGG 250 

50 GCGATTCACC GCGCAGCACC TTCCTACGAA GAGCTGTCAA ACTCTCAGGA 300 

ACTGCTGGAA ACCGGTATCA AAGTTATCGA CCTGATGTGT CCGTTCGCTA 3 50 

AGGGCGGTAA AGTTGGTCTG TTCGGTGGTG CGGGTGTAGG TAAAACCGTA 400 

AACATGATGG AGCTCATTCG TAACATCGCG ATCGAGCACT CCGGTTACTC 450 

TGTGTTTGCG GGCGTAGGTG AACGTACTCG TGAGGGGAAC GACTTCTACC 50 0 

55 ACGAAATGAC CGACTCCAAC GTTATCGATA AAGTATCCCT GGTGTATGGC 550 

CAGATGAACG AGCCGCCGGG AAACCGTCTG CGCGTAGCTC TGACCGGTCT 600 

GACCATGGCT GAGAAATTCC GTGACGAAGG TCGTGACGTT CTGCTGTTCG 650 

TTGACAACAT CTATCGTTAC ACCCTGGCCG GTACGGAAGT ATCCGCACTG 700 

CTGGGCCGTA TGCCTTCAGC GGTAGGTTAT CAGCCGACCC TGGCGGAAGA 750 

60 GATGGGCGTT CTGCAGGAAC GTATCACCTC CACCAAAACC GGTT 794 
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2) INFORMATION FOR SEQ ID NO: 943 

5 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 814 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 
10 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
15 (A) ORGANISM: Escherichia coli 

(B) STRAIN: ATCC 35401 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 943 

20 ATGCCGTACC GCGCGTGTAC GATGCTCTTG AGGTGCAAAA TGGTAATGAG 50 

CGTCTGGTGC TGGAAGTTCA GCAGCAGCTC GGCGGCGGTA TCGTACGTAC 100 

CATCGCAATG GGTTCCTCCG ACGGTCTGCG TCGCGGTCTG GATGTAAAAG 150 

ACCTCGAACA CCCGATTGAA GTCCCGGTAG GTAAAGCGAC TCTGGGCCGT 20 0 

ATCATGAACG TACTGGGTGA ACCGGTCGAC ATGAAAGGCG AGATCGGTGA 250 

25 AGAAGAGCGT TGGGCGATTC ACCGCGCAGC ACCTTCCTAC GAAGAGCTGT 3 00 

CAAACTCTCA GGAACTGCTG GAAACCGGTA TCAAAGTTAT CGACCTGATG 350 

TGTCCGTTCG CTAAGGGCGG TAAAGTTGGT CTGTTCGGTG GTGCGGGTGT 400 

AGGTAAAACC GTAAACATGA TGGAGCTCAT TCGTAACATC GCGATCGAGC 450 

ACTCCGGTTA CTCTGTGTTT GCGGGCGTAG GTGAACGTAC TCGTGAGGGT 500 

30 AACGACTTCT ACCACGAAAT GACCGACTCC AACGTTATCG ACAAAGTATC 550 

CCTGGTGTAT GGCCAGATGA ACGAGCCGCC GGGAAACCGT CTGCGCGTTG 600 

CTCTGACCGG TCTGACCATG GCTGAGAAAT TCCGTGACGA AGGTCGTGAC 650 

GTTCTGCTGT TCGTTGACAA CATCTATCGT TACACCCTGG CCGGTACGGA 700 

AGTATCCGCA CTGCTGGGCC GTATGCCTTC AGCGGTAGGT TATCAGCCGA 750 

3 5 CCCTGGCGGA AGAGATGGGC GTTCTGCAGG AACGTATCAC CTCCACCAAA 800 

ACTGGTTCTA TCAC 814 



40 2) INFORMATION FOR SEQ ID NO: 944 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 798 bases 

(B) TYPE: Nucleic acid 
45 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

50 (vi) ORIGINAL SOURCE : 

(A) ORGANISM: Escherichia coli 

(B) STRAIN: ATCC 11775 



55 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 944 



GTGTACGATG CTCTTGAGGT GCAAAATGGT AATGAGCGTC TGGTGCTGGA 50 

AGTTCAGCAG CAGCTCGGCG GCGGTATCGT GCGTACCATC GCAATGGGTT 100 

CCTCCGACGG TCTGCGTCGC GGTCTGGATG TAAAAGACCT CGAACACCCG 150 

ATCGAAGTCC CGGTAGGTAA AGCGACTCTG GGCCGTATCA TGAACGTACT 200 

60 GGGTGAACCG GTCGACATGA AAGGCGAGAT CGGTGAAGAA GAGCGTTGGG 250 
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CGATTCACCG 
CTGCTGGAAA 
GGGCGGTAAA 
ACATGATGGA 
GTGTTTGCGG 
CGAAATGACC 
AGATGAACGA 
ACCATGGCTG 
TGACAACATC 
TGGGCCGTAT 
ATGGGCGTTC 



CGCAGCACCT 
CCGGTATCAA 
GTTGGTCTGT 
GCTTATTCGT 
GCGTAGGTGA 
GACTCCAACG 
GCCGCCGGGA 
AGAAATTCCG 
TATCGTTACA 
GCCTTCAGCG 
TGCAGGAACG 



TCCTACGAAG 
AGTTATCGAC 
TCGGTGGTGC 
AACATCGCGA 
ACGTACTCGT 
TTATCGACAA 
AACCGTCTGC 
TGACGAAGGT 
CCCTGGCCGG 
GTAGGTTATC 
TATCACCTCC 



AGCTGTCAAA 
CTGATGTGTC 
GGGTGTAGGT 
TCGAGCACTC 
GAGGGTAACG 
AGTATCCCTG 
GCGTTGCTCT 
CGTGACGTTC 
TACGGAAGTA 
AGCCGACCCT 
ACCAAAACCG 



CTCTCAGGAA 3 0/0 

CGTTCGCTAA 350 

AAAACCGTAA 4 00 

CGGTTACTCT 450 

ACTTCTACCA 5 00 

GTGTATGGCC 550 

GACCGGTCTG 600 

TGCTGTTCGT 650 

TCCGCACTGC 7 00 

GGCGGAAGAG 750 

GTTCTATC 798 



2) INFORMATION FOR SEQ ID NO: 945 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 812 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Escherichia coli 

(B) STRAIN: ATCC 25922 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 945 

ATGCCGTACC GCGCGTGTAC GATGCTCTTG AGGTGCAAAA TGGTAATGAG 50 

CGTCTGGTGC TGGAAGTTCA GCAGCAGCTC GGCGGCGGTA TCGTGCGTAC 100 

CATCGCAATG GGTTCCTCCG ACGGTCTGCG TCGCGGTCTG GATGTAAAAG 150 

ACCTCGAACA CCCGATCGAA GTCCCGGTAG GTAAAGCGAC TCTGGGCCGT 200 

ATCATGAACG TACTGGGTGA ACCGGTCGAC ATGAAAGGCG AGATCGGTGA 250 

AGAAGAGCGT TGGGCGATTC ACCGCGCAGC ACCTTCCTAC GAAGAGCTGT 3 00 

CAAACTCTCA GGAACTGCTG GAAACCGGTA TCAAAGTTAT CGACCTGATG 350 

TGTCCGTTCG CTAAGGGCGG TAAAGTTGGT CTGTTCGGTG GTGCGGGTGT 4 00 

AGGTAAAACC GTAAACATGA TGGAGCTTAT TCGTAACATC GCGATCGAGC 450 

ACTCCGGTTA CTCTGTGTTT GCGGGCGTAG GTGAACGTAC TCGTGAGGGT 500 

AACGACTTCT ACCACGAAAT GACCGACTCC AACGTTATCG ACAAAGTATC 550 

CTTGGTGTAT GGCCAGATGA ACGAGCCGCC GGGAAACCGT CTGCGCGTTG 600 

CTCTGACCGG TCTGACCATG GCTGAGAAAT TCCGTGACGA AGGTCGTGAC 650 

GTATTGCTGT TCGTCGATAA CATCTATCGT TACACCCTGG CCGGTACCGA 700 

AGTATCCGCA CTGCTGGGCC GTATGCCTTC AGCGGTAGGT TATCAGCCGA 750 

CCCTGGCGGA AGAGATGGGC GTTCTGCAGG AACGTATCAC CTCCACCAAA 8 00 

ACCGGTTCTA TC q 19 



2) INFORMATION FOR SEQ ID NO: 946 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 832 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 
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(vi) ORIGINAL, SOURCE: 

(A) ORGANISM: Neisseria polysaccharea ■ 

(B) STRAIN: ATCC 43768 

5 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 94 6 



GCGACGCTAT CCCGCATGTT TACGATGCCC TGAAATTGGA CGAGAACGGT 50 

CTGACTCTGG AAGTTCAACA ACTTCTGGGT GACGGCGTTG TCCGTACTAT 100 

TGCAATGGGT AGTTCAGACG GCCTGAAACG CGGCATGTCT GTAAGCAATA 150 

10 CTGGTGCGCC AATCACTGTG CCGGTAGGTA AAGGTACTTT GGGTCGTATT 2 00 

GTCGACGTAT TGGGTACGCC TGTTGATGAA GCAGGTCCGA TCGATACCGA 2 50 

CAAGAGCCGT GCCATTCACC AAACTGCTCC GAAATTCGAC GAGTTGTCTG 300 

CAACTACCGA ATTGTTGGAA ACCGGTATTA AAGTGATCGA CTTGCTGTGT 350 

CCGTTTGCTA AAGGCGGTAA AGTAGGTCTG TTCGGTGGTG CCGGTGTAGG 4 00 

15 CAAAACCGTG AACATGATGG AATTGAT CAA CAACATCGCC AAAGCGCACA 4 50 

GCGGTCTGTC CGTGTTCGCA GGTGTGGGCG AGCGTACCCG TGAAGGTAAC 500 

GACTTCTACC ACGAGATGAA AGATTCCAAC GTATTGGATA AAGTGGCAAT 550 

GGTTTACGGT CAGATGAACG AACCTCCGGG CAACCGTTTG CGCGTCGCAT 600 

TGACCGGTTT GACCATGGCG GAATACTTCC GTGACGAAAA AGACGAAAAC 650 

2 0 GGTAAAGGTC GCGACGTATT GTTCTTCGTT GACAACATCT ACCGTTACAC 700 

TCTGGCCGGT ACCGAAGTAT CTGCACTGTT GGGCCGTATG CCTTCTGCAG 7 50 

TGGGTTACCA ACC GACATTG GCTGAAGAAA TGGGTCGTTT GCAAGAGCGT 8 00 

ATTACCTCTA CCCAAACCGG TTCCATTACT TC 832 

25 



2) INFORMATION FOR SEQ ID NO: 947 

(i) SEQUENCE CHARACTERISTICS: 
3 0 (A) LENGTH: 840 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



3 5 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Neisseria sicca 

(B) STRAIN: ATCC 9913 

40 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 947 



TCCGCGCGAT GCCATTCCGC ATGTTTACGA CGCCCTGAAA TTGGATGCAA 50 

ACGGCCTGAC TTTGGAAGTA CAACAGCTTC TGGGCGACGG CGTGGTTCGT 100 

45 ACTATTGCAA TGGGTAGTTC GGACGGTCTG AAACGCGGCA TGACTGTAAG 150 

CAATACAGAT GCGCCGATTA CTGTGCCGGT AGGTAAAGGT ACTTTGGGAC 2 00 

GTATTGTCGA TGTGTTGGGT ACACCTGTTG ATGAAGCAGG TCCGATTGAT 250 

ACCGACAAAC ACCGTGCTAT CCATCAGACA GCTCCGAAAT TCGATGAGTT 300 

GTCTGCTACT ACCGAGCTGC TGGAAACAGG CATTAAAGTG ATTGACTTGC 3 50 

50 TGTGTCCGTT TGCCAAAGGC GGTAAAGTAG GTCTGTTCGG TGGTGCCGGT 4 00 

GTAGGCAAAA CCGTCAACAT GATGGAATTG ATTAACAACA TCGCCAAAGC 4 50 

GCATAGTGGT TTGTCCGTGT TCGCCGGTGT GGGGGAACGT ACCCGTGAAG 500 

GTAACGACTT CTACCACGAG ATGAAAGATT CCAACGTATT GGACAAAGTG 550 

GCGATGGTTT ACGGTCAGAT GAACGAACCT CCGGGTAACC GTCTGCGTGT 600 

55 AGCCTTGACC GGTTTGACGA TGGCCGAATA CTTCCGTGAT GAAAAAGACG 650 

AAAGCGGCAA AGGTCGCGAC GTATTGTTCT TCGTGGACAA CATTTACCGT 7 00 

TACACTCTGG CCGGTACAGA AGTATCCGCA TTGCTCGGTC GTATGCCTTC 750 

AGCAGTAGGT TACCAACCGA CATTGGCTGA AGAAATGGGT CGTCTGCAAG 800 

AGCGTATTAC CCTCTACTCA AACAGGCTCC ATTACTTCTA 840 



60 
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2) INFORMATION FOR SEQ ID NO: 948 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 843 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus mitis 
15 (B) STRAIN: ATCC 903 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 948 

GCAGCTGGCG AGAAGCTACC TGAGATCAAT AATGCACTTG TAGTCTATAA 50 

20 AAATGACGAA AAAAAATCAA AAATCGTCCT TGAAGTAGCT CTTGAGCTTG 100 

GTGATGGAGT GGTTCGGACC ATCGCTATGG AATCAACGGA TGGGTTGACT 150 

CGTGGCATGG AAGTGCTAGA TACTGGCCGT CCAATTTCTG TGCCAGTCGG 2 00 

CAAAGAAACA CTTGGTCGCG TCTTTAACGT TTTGGGAGAT ACCATTGACT 2 50 

TGGATGCTCC TTTTGCGGAT GATGCAGAGC GCCAGCCAAT CCATAAGAAA 3 00 

25 GCTCCAACCT TTGATGAGTT GTCTACTTCT TCAGAGATCT TAGAGACAGG 3 50 

TATCAAGGTT ATCGACCTGT TAGCCCCTTA TCTGAAAGGT GGTAAAGTTG 4 00 

GACTCTTCGG TGGTGCCGGA GTTGGTAAGA CCGTCCTGAT TCAAGAATTG 450 

ATCCACAACA TTGCCCAAGA ACACGGTGGT ATTT CTGTAT TTACTGGCGT 500 

TGGGGAACGT ACCCGTGAAG GGAATGACCT TTATTGGGAA ATGAAAGAGT 550 

30 CTGGTGTTAT CGAGAAAACA GCCATGGTCT TCGGTCAGAT GAATGAGCCG 600 

CCAGGAGCGC GTATGCGGGT TGCTTTGACT GGTTTGACGA TTGCAGAATA 650 

CTTCCGTGAT GTGGAAGGTC AAGATGTCTT GCTCTTCATT GACAACATCT 700 

TCCGTTTCAC GCAGGCAGGT TCTGAAGTTT CTGCCCTTTT GGGTCGGATG 750 

CCGTCAGCCG TTGGTTACCA ACCAACACTT GCGACAGAAA TGGGGCAATT 800 

35 GCAAGAGCGT ATCACATCGA CTAAGAAGGG TTCTGTAACC TCT 843 



40 



2) INFORMATION FOR SEQ ID NO: 94 9 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 841 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
45 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
50 (A) ORGANISM: Streptococcus mitis 

(B) STRAIN: ATCC 49456 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 949 

55 GCAGCAGGGG AAAAACTTCC TGAGATTAAC AATGCACTTG TCGTCTACAA 50 

AAATGACGAA AGAAAAACAA AAATCGTCCT TGAAGTAGCC TTGGAGTTGG 100 

GAGATGGTAT GGTCCGTACT ATCGCCATGG AATCAACAGA TGGTTTGACT 150 

CGTGGAATGG AAGTATTGGA CACAGGTCGT CCAATCTCTG TACCAGTAGG 200 

TAAAGAAACT TTGGGACGTG TCTTCAATGT TTTGGGAGAT ACCATTGACT 250 

60 TGGAAGCTCC TTTTACAGAA GATGCAGAGC GTCAGCCAAT TCATAAAAAA 300 
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GCTCCAACTT TTGATGAATT GTCTACCTCT 
GATTAAGGTT ATCGACCTTC TTGCCCCTTA 
GACTTTTCGG TGGTGCCGGA GTTGGTAAAA 
ATTCACAACA TTGCCCAAGA ACACGGTGGT 
5 TGGGGAACGT ACTCGTGAGG GTAATGACCT 
CAGGTGTTAT CGAGAAAACA GCCATGGTAT 
CCAGGAGCAC GTATGCGTGT TGCCCTAACT 
CTTCCGTGAT GTGGAAGGCC AAGACGTGCT 
TCCGTTTCAC TCAGGCTGGT TCAGAAGTAT 
10 CCATCAGCCG TTGGTTACCA ACCAACACTT 
GCAAGAGCGT ATTAC AT CAA CTAAAAAGGG 



PCT/CA00/01150 

TCTGAAATCC TTGAAACAGG 350 

CCTTAAAGGT GGTAAGGTTG 4 00 

CCGTCTTAAT CCAAGAATTG 450 

ATTTCAGTAT TTACCGGTGT 500 

TTACTGGGAA ATGAAAGAAT 550 

TTGGTCAGAT GAATGAGCCG 60 0 

GGTTTGACAA TCGCCGAATA 650 

TCTCTTTATC GATAATATCT 700 

CTGCCCTTTT GGGTCGTATG 750 

GCTACGGAAA TGGGTCAATT 800 

TTCTGTAACC T 841 



15 2) INFORMATION FOR SEQ ID NO: 950 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 827 bases 

(B) TYPE: Nucleic acid 

2 0 (C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

25 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus mitis 

(B) STRAIN: LSPQ 2583 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 950 



GCTACCTGAG ATCAATAATG CACTTGTAGT CTATAAAAAT GACGAAAATA 50 

AATCAAAAAT CGTCCTTGAA GTAGCTCTTG AGCTTGGTGA TGGAGTGGTT 100 

CGGACCATCG CTATGGAATC AACGGATGGG TTGACTCGTG GCATGGAAGT 150 

GCTAGATACT GGTCGTCCAA TTTCTGTGCC AGTCGGCAAA GAAACACTTG 200 

35 GTCGCGTCTT TAACGTTTTG GGAGATACCA TTGACTTGGA TGCTCCTTTT 250 

GCGGATGATG CAGAGCGCCA GCCAATCCAT AAGAAAGCTC CAACCTTTGA 3 00 

TGAGTTGTCT ACTTCATCAG AGATCTTAGA GACAGGTATC AAGGTTATCG 350 

ACCTGTTAGC ACCTTATCTG AAAGGTGGTA AAGTCGGACT CTTCGGTGGT 400 

GCCGGAGTTG GTAAGACCGT CCTGATTCAG GAATTGATCC ACAACATTGC 450 

40 CCAAGAGCAT GGTGGTATTT CCGTGTTTAC CGGTGTTGGG GAACGTACCC 500 

GTGAAGGGAA TGACCTTTAC TGGGAAATGA AGGAGTCTGG CGTTATCGAG 550 

AAAACAGCCA TGGTCTTCGG TCAGATGAAT GAGCCACCAG GAGCGCGTAT 600 

GCGGGTTGCT TTGACTGGTT TGACGATTGC AGAGTACTTC CGTGATGTAG 650 

AAGGTCAAGA TGTCTTGCTC TTCATTGACA ACATCTTCCG TTTCACGCAG 700 

45 GCAGGTTCTG AAGTCTCTGC CCTTTTGGGT CGGATGCCAT CAGCCGTTGG 750 

TTACCAACCA ACACTTGCGA CTGAAATGGG ACAACTCCAA GAGCGTATTA 800 

CATCGACTAA GAAAGGTTCT GTAACTT 827 



50 

2 ) INFORMATION FOR SEQ ID NO: 951 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 844 bases 
55 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE : Genomic DNA 

60 
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(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus oralis 

(B) STRAIN: ATCC 35037 

5 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 951 

GCAGCAGGGG AAACACTTCC TGAGATTAAT AATGCACTTG TCGTCTACAA 50 

AAATGACGAA AGAAAAACAA AAATCGTCCT TGAAGTAGCC TTGGAGTTGG 100 

GTGATGGTAT GGTCCGTACG ATCGCCATGG AATCAACAGA TGGTTTGACT 150 

10 CGTGGAATGG AAGTTTTGGA CACAGGCCGT CCAATCTCTG TACCAGTAGG 200 

TAAAGAAACT TTGGGACGTG TCTTCAACGT TTTGGGAGAT ACTATTGACT 250 

TGGATGCTCC TTTCGCTGAA GACGCTGAGC GTCAGCCAAT TCATAAGAAA 30 0 

GCTCCAACTT TTGATGAATT GTCTACCTCA TCTGAAATCT TGGAAACAGG 350 

GATTAAGGTT ATCGACCTTC TTGCCCCTTA CCTTAAAGGT GGGAAAGTTG 4 00 

15 GACTCTTCGG TGGTGCCGGA GTTGGTAAAA CTGTCTTGAT CCAAGAGTTG 450 

ATTCACAACA TTGCCCAAGA ACATGGTGGT ATTTCAGTAT TTACCGGTGT 500 

TGGAGAACGT ACCCGTGAGG GGAACGACCT TTACTGGGAA ATGAAAGAAT 550 

CAGGCGTTAT CGAGAAAACA GCCATGGTAT TTGGTCAGAT GAATGAGCCA 600 

CCTGGAGCAC GTATGCGTGT TGCTCTTACT GGTTTGACAA TCGCCGAATA 650 

20 CTTCCGTGAT GTAGAAGGCC AAGATGTGCT TCTCTTTATC GACAATATCT 700 

TCCGTTTCAC TCAAGCTGGT TCAGAAGTAT CTGCCCTTTT GGGTCGGATG 750 

CCTTCAGCCG TTGGTTACCA ACCAACACTT GCTACGGAAA TGGGTCAATT 800 

GCAAGAACGT ATCACATCAA CTAAGAAGGG TTCTGTAACC TCTA 844 



25 



40 



60 



2) INFORMATION FOR SEQ ID NO: 952 



(i) SEQUENCE CHARACTERISTICS: 
3 0 (A) LENGTH: 83 0 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

35 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus pneumoniae 

(B) STRAIN: StrR-06 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 952 



GCAGCAGGGG AAAAACTTCC TGAGATTAAC AATGCACTTG TCGTCTACAA 50 

AAATGACGAA AGAAAAACAA AAATCGTCCT TGAAGTAGCC TTGGAGTTAG 100 

45 GAGATGGTAT GGTTCGTACT ATCGCCATGG AATCAACAGA TGGGTTGACT 150 

CGTGGAATGG AAGTATTGGA CACAGGTCGT CCAATCTCTG TACCAGTAGG 2 00 

TAAAGAAACT TTGGGACGTG TCTTCAACGT TTTGGGAGAT ACCATTGATT 250 

TGGAAGCTCC TTTTACAGAA GACGCAGAGC GTCAGCCAAT TCATAAAAAA 300 

GCTCCAACTT TTGATGAGTT GTCTACCTCT TCTGAAATCC TTGAAACAGG 350 

50 GATCAAGGTT ATTGACCTTC TTGCCCCTTA CCTTAAAGGT GGTAAAGTTG 400 

GACTTTTCGG TGGTGCCGGA GTTGGTAAAA CCGTCTTAAT CCAAGAATTG 450 

ATTCACAACA TTGCCCAAGA GCACGGTGGT ATTTCAGTAT TTACTGGTGT 500 

TGGGGAACGT ACTCGTGAGG GGAATGACCT TTACTGGGAA ATGAAAGAAT 550 

CAGGCGTTAT CGAGAAAACA GCCATGGTCT TTGGTCAGAT GAATGAGCCA 600 

55 CCAGGAGCAC GTATGCGTGT TGCCCTTACT GGTTTGACAA TCGCTGAATA 650 

CTTCCGTGAT GTGGAAGGCC AAGACGTGCT TCTCTTTATC GATAATATCT 700 

TCCGTTTCAC TCAGGCTGGT TCAGAAGTAT CTGCCCTTTT GGGTCGTATG 750 

CCATCAGCCG TTGGTTACCA ACCAACACTT GCTACGGAAA TGGGTCAATT 800 

GCAAGAACGT ATCACATCAA CCAAGAAGGG 83 0 
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2 ) INFORMATION FOR SEQ ID NO: 953 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 823 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus pneumoniae 
15 (B) STRAIN: StrR-11 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 953 

AAAAACTTCC TGAGATTAAC AATGCACTTG TCGTCTACAA AAATGACGAA 50 

2 0 AGAAAAACAA AAATCGTCCT TGAAGTAGCC TTGGAGTTAG GAGATGGTAT 100 

GGTTCGTACT ATCGCCATGG AATCAACAGA TGGGTTGACT CGTGGAATGG 150 

AAGTATTGGA CACAGGTCGT CCAATCTCTG TACCAGTAGG TAAAGAAACT 200 

TTGGGACGTG TCTTCAACGT TTTGGGAGAT ACCATTGATT TGGAAGCTCC 250 

TTTTACAGAA GACGCAGAGC GTCAGCCAAT TCATAAAAAA GCTCCAACTT 300 

25 TTGATGAGTT GTCTACCTCT TCTGAAATCC TTGAAACAGG GATCAAGGTT 350 

ATTGACCTTC TTGCCCCTTA CCTTAAAGGT GGTAAAGTTG GACTTTTCGG 4 00 

TGGTGCCGGA GTTGGTAAAA CCGTCTTAAT CCAAGAATTG ATTCACAACA 450 

TTGCCCAAGA GCACGGTGGT ATTTCAGTAT TTACTGGTGT TGGGGAACGT 500 

ACTCGTGAGG GGAATGACCT TTACTGGGAA ATGAAAGAAT CAGGCGTTAT 550 

3 0 CGAGAAAACA GCCATGGTCT TTGGTCAGAT GAATGAGCCA CCAGGAGCAC 600 

GTATGCGTGT TGCCCTTACT GGTTTGACAA TCGCTGAATA CTTCCGTGAT 650 

GTGGAAGGCC AAGACGTGCT TCTCTTTATC GATAATATCT TCCGTTTCAC 700 

TCAGGCTGGT TCAGAAGTAT CTGCCCTTTT GGGTCGTATG CCATCAGCCG 750 

TTGGTTACCA ACCAACACTT GCTACGGAAA TGGGTCAATT GCAAGAACGT 800 

35 ATCACATCAA CTAAGAAGGG TTC 823 



40 



2) INFORMATION FOR SEQ ID NO: 954 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 844 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
45 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
50 (A) ORGANISM: Streptococcus pneumoniae 

(B) STRAIN: StrR-55 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 954 

55 GCAGCAGGGG AAAAACTTCC TGAGATTAAC AATGCACTTG TCGTCTACAA 50 

AAATGACGAA AGAAAAACAA AAATCGTCCT TGAAGTAGCC TTGGAGTTAG 100 

GAGATGGTAT GGTTCGTACT ATCGCCATGG AATCAACAGA TGGGTTGACT 150 

CGTGGAATGG AAGTATTGGA CACAGGTCGT CCAATCTCTG TACCAGTAGG 200 

TAAAGAAACT TTGGGACGTG TCTTCAACGT TTTGGGAGAT ACCATTGACT 250 

60 TGGAAGCTCC TTTTACAGAA GACGCAGAGC GTCAGCCAAT TCATAAAAAA 300 
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GCTCCAACTT 
GATCAAGGTT 
GACTTTTCGG 
ATTCACAACA 
TGGGGAACGT 
CAGGCGTTAT 
CCAGGAGCAC 
CTTCCGTGAT 
TCCGTTTCAC 
CCATCAGCCG 
GCAAGAACGT 



TTGATGAGTT 
ATTGACCTTC 
TGGTGCCGGA 
TTGCCCAAGA 
ACTCGTGAGG 
CGAGAAAACA 
GTATGCGTGT 
GTGGAAGGCC 
TCAGGCTGGT 
TTGGTTACCA 
ATCACATCAA 



GTCTACCTCT 
TTGCCCCTTA 
GTTGGTAAAA 
GCACGGTGGT 
GGAATGACCT 
GCCATGGTCT 
TGCCCTTACT 
AAGACGTGCT 
TCAGAAGTAT 
ACCAACACTT 
CCAAGAAGGG 



TCTGAAATCC 
CCTTAAAGGT 
CTGTCTTAAT 
ATTTCAGTAT 
TTACTGGGAA 
TTGGTCAGAT 
GGTTTGACAA 
TCTCTTTATC 
CTGCCCTTTT 
GCTACGGAAA 
TTCTGTAACC 
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TTGAAACAGG 350 

GGTAAAGTTG 4 0 0 

CCAAGAATTG 450 

TTGCTGGTGT 50 0 

ATGAAAGAAT 550 

GAATGAGCCA 60 0 

TCGCTGAATA 65 0 

GATAATATCT 7 00 

GGGTCGTATG 750 

TGGGTCAATT 8 00 

TCTA 844 



15 2) INFORMATION FOR SEQ ID NO: 955 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 834 bases 

(B) TYPE: Nucleic acid 
20 (C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

2 5 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus pneumoniae 

(B) STRAIN: StrR-05 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 955 



AAAAACTTCC TGAGATTAAC AATGCACTTG TCGTCTACAA AAATGACGAA 50 

AGAAAAACAA AAATCGTCCT TGAAGTAGCC TTGGAGTTAG GAGATGGTAT 100 

GGTTCGTACT ATCGCCATGG AATCAACAGA TGGGTTGACT CGTGGAATGG 150 

AAGTATTGGA CACAGGTCGT CCAATCTCTG TACCAGTAGG TAAAGAAACT 2 00 

3 5 TTGGGACGTG TCTTCAACGT TTTGGGAGAT ACCATTGATT TGGAAGCTCC 2 50 

TTTTACAGAA GACGCAGAGC GTCAGCCAAT TCATAAAAAA GCTCCAACTT 3 00 

TTGATGAGTT GTCTACCTCT TCTGAAATCC TTGAAACAGG GATCAAGGTT 3 50 

ATTGACCTTC TTGCCCCTTA CCTTAAAGGT GGTAAAGTTG GACTTTTCGG 400 

TGGTGCCGGA GTTGGTAAAA CCGTCTTAAT CCAAGAATTG ATTCACAACA 45 0 

4 0 TTGCCCAAGA GCACGGTGGT ATTTCAGTAT TTACTGGTGT TGGGGAACGT 500 

ACTCGTGAGG GGAATGACCT TTACTGGGAA ATGAAAGAAT CAGGCGTTAT 550 

CGAGAAAACA GCCATGGTCT TTGGTCAGAT GAATGAGCCA CCAGGAGCAC 60 0 

GTATGCGTGT TGCCCTTACT GGTTTGACAA TCGCTGAATA CTTCCGTGAT 650 

GTGGAAGGCC AAGACGTGCT TCTCTTTATC GATAATATCT TCCGTTTCAC 700 

45 TCAGGCTGGT TCAGAAGTAT CTGCCCTTTT GGGTCGTATG CCATCAGCCG 750 

TTGGTTACCA ACCAACACTT GCTACGGAAA TGGGTCAATT GCAAGAACGT 8 00 

ATCACATCAA CCAAGAAGGG TTCTGTAACC TCTA 834 



50 

2) INFORMATION FOR SEQ ID NO: 956 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 495 bases 
55 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

60 
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(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Babesia microti 

(B) STRAIN: Persing-1 

5 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 956 



TTGTATATCA CAGGCACTCA GCAAATATTC 
ACGTGGGTTG TGGTGAACGT GGGAATGAAA 
TTCCCTGAAC TATCTACTGT AGTTAATGAT 

10 ACGTACATGC TTAGTTGCCA ATACTTCTAA 
AAGCTAGTAT ATACACTGGT ATTACAATTG 
GGTTACAACT GCACTCTTAT GGCCGATTCC 
TCTAAGGGAA ATTTCTGGTA GATTGGCTGA 
ATCCGGCCTA TTTATCGTCA AGGTTGTCAG 

15 GGGATAACTG TCTAATTAAT TTAGGCTTGA 
ACACGAACCG GATCTATTAC GGTTGTAGGA 



CGATACTGAC GTAATTATAT 50 

TGGCTGAGAT TCTATGCGAA 100 

GAAAAGGTGG CCATTATGGA 150 

TATGCCAGTG GCCGCTAGAG 2 00 

CTGAATATTT CCGTGATATG 2 50 

ACTAGCCGAT GGGCAGAGGC 300 

AATGCCTGCA GATTCTGGCT 3 50 

CTTTTTATGA ACGTGCAGGT 4 00 

TTAAGTGCTT AGGTTCACCA 4 50 

GCAGTTTCTC CACCA 4 95 



20 2) INFORMATION FOR SEQ ID NO: 957 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 69 bases 

(B) TYPE: Nucleic acid 
25 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

3 0 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Entamoeba histolytica 

(B) STRAIN: HM1-1MSS 



35 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 957 



AGTTATTTCA CAAGCATTAA GTAAATATAG TAATTCAGAT GTTATTATTT 50 

ATGTAGGATG TGGTGAAC GA GGAAATGAAA TGGCAGAAGT TCTTCGAGAT 100 

TTTCCAGCTC TTTCTATTAA AGTAGGAGAT AAAGAAGAAT CTATTATGAC 150 

AAGAACAGCA CTTGTTGCTA ATACATCTAA TATGCCTGTT GCAGCACGTG 2 00 

4 0 AAGCATCAAT TTATACTGGA ATTACATTAT CAGAATATTA TAGAGATATG 2 50 

GGATATAATG TTGCTATGAT GGCAGATTCA ACATCAAGAT GGGCTGAAGC 3 00 

ACTTAGAGAA ATTTCAGGAC GTCTTGCAGA AATGCCAGCT GATTCTGGAT 3 50 

ATCCAGCATA TCTTGCAGCA CGTTTAGCAT CATTTTATGA ACGTGCAGGT 4 00 

ATGGTTGAAT GTTTAGGATC ACCAAAAAGA ATAGGGTCAG TTTCTATTGT 450 

45 AGGAGCTGTT TCACCACCT 4 69 



2) INFORMATION FOR SEQ ID NO: 958 



50 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 452 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
55 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
60 (A) ORGANISM: Fusobacterium nucleatum subsp. polymorphism 

533 
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(B) STRAIN: ATCC 10953 
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(xi) SEQUENCE DESCR! 

5 TACAACACCA ACTTGCTAAA 
GGTTGTGGGG AACGTGGAAA 
AGAAATTATT GACCCTAAGA 
TTATAGCTAA TACTTCTAAT 
TATACTGGTA TAACTATTGC 
10 GGCACTTATG GCAGATTCAA 
TGTCAGGACG TTTGGAAGAA 
CTATCAAGTA GAATAGCAGA 
TCTAGGTAAT GGAGAAGAAG 
CA 

15 



PTION: SEQ ID NO: 958 

TGGGCAGATG CAGAAGTAGT 
TGAAATGACC GATGTACTTA 
CAGGACAATC TTTAATGAAG 
ATGCCAGTTG CTGCTCGTGA 
AGAATATTTT AGAGATATGG 
CAAGTCGTTG GGCAGAAGCA 
ATGCCAGGTG ATGAAGGATA 
GTTTTATGAA AGAGCAGGGC 
GAGCATTAAC TGTAATTGGA 



TGTTTATGTT 50 

TGGAATTCCC * 100 

AGAACAGTTC 150 

GGCTTCAATC 2 00 

GATATTCAGT 25 0 

CTTCGTGAAA 300 

TCCAGCATAT 350 

TTGTTGAATG 4 00 

GCAGTATCTC 450 

452 



2) INFORMATION FOR SEQ ID NO: 959 

20 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 469 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



25 



45 



60 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Leishmania aethiopica 
30 (B) STRAIN: ATCC 50119 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 959 

TGTAATTAGT CAGGCCCTCT CCAAGTACTC CAACTCCGAC TGCGTCATCT 50 

3 5 ATGTCGGCTG CGGCGAGCGC GGTAATGAGA TGGCCGAGGT GCTCATGGAG 100 

TTCCCGACCC TGACGACCGT GATCAATGGT CGCGAGGAGT CGATCATGAA 150 

GCGCACCTGC CTCGTGGCGA ACACTTCGAA CATGCCAGTC GCAGCCCGTG 2 00 

AGGCCTCTAT TTACACCGGC ATCACCCTGG CCGAGTACTA CCGTGATATG 250 

GGCAAGCATA TCGCCATGAT GGCCGACTCG ACGTCTCGCT GGGCCGAGGC 300 

40 GCTTCGTGAG ATTTCGGGTC GTCTGGCGGA GATGCCGGCC GATGGTGGCT 350 

ACCCTGCCTA TCTCAGCGCT CGTCTCGCCT CCTTCTACGA GCGCGCCGGC 400 

CTCGTCACCT GCATCGGCGG GCCGAAGCGC CAGGGCTCCG TCACGATTGT 450 

CGGTGCCGTG TCTCCGCCG 469 



2) INFORMATION FOR SEQ ID NO: 960 



(i) SEQUENCE CHARACTERISTICS: 
50- (A) LENGTH: 4 69 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS; Double 

(D) TOPOLOGY: Linear 

55 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Leishmanial tropica 

(B) STRAIN: ATCC 30815 
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(xi) SEQUENCE DESCR! 

TGTAATTAGT CAGGCCCTCT 
ATGTCGGCTG CGGCGAGCGC 
5 TTCCCGACCC TGACGACCGT 
GCGCACCTGC CTCGTGGCGA 
AGGCCTCTAT TTACACCGGC 
GGCAAGCATA TCGCCATGAT 
GCTTCGTGAG ATTTCGGGTC 
10 ACCCTGCCTA TCTCAGCGCT 
CTCGTCACCT GCATCGGCGG 
CGGTGCCGTG TCTCCGCCG 



PTION: SEQ ID NO: 960 

CCAAGTACTC CAACTCCGAC 
GGTAATGAGA TGGCCGAGGT 
GATCGATGGT CGCGAGGAGT 
ACACTTCGAA CATGCCAGTC 
ATCACCCTGG CCGAGTACTA 
GGCCGACTCG ACGTCTCGCT 
GTTTGGCGGA GATGCCGGCC 
CGTCTCGCCT CCTTCTACGA 
GCCGAAGCGC CAGGGCTCCG 



TGCGTCATCT 50 

GCTCATGGAG 100 

CGATCATGAA 150 

GCAGCCCGTG 2 00 

CCGTGATATG 250 

GGGCCGAGGC 300 

GATGGTGGCT 350 

GCGCGCCGGC 400 

TCACGATTGT 450 

469 
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2) INFORMATION FOR SEQ ID NO: 961 



20 



25 



30 



(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 469 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Leishmania guyanensis 

(B) STRAIN: ATCC 50126 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 961 



TGTCATCAGT 
ATGTCGGCTG 
TTCCCGACCC 

35 GCGCACCTGC 
AGGCCTCTAT 
GGCAAGCACA 
GCTGCGTGAG 
ACCCTGCCTA 

40 CTCGTCACCT 
CGGTGCAGTG 



CAGGCCCTCT 
CGGTGAACGC 
TGACGACTGT 
CTCGTGGCGA 
TTATACCGGC 
TTGCCATGAT 
ATTTCGGGTC 
CCTCAGCGCC 
GCATCGGCGG 
TCTCCACCG 



CCAAGTACTC 
GGTAACGAGA 
GATCGATGGT 
ACACCTCGAA 
ATCACCCTTG 
GGCCGACTCG 
GATTGGCGGA 
CGCCTCGCCT 
GCCGAAGCGC 



CAACTCGGAC 
TGGCCGAGGT 
CGCGAAGAGT 
CATGCCCGTC 
CTGAGTACTA 
ACGTCTCGCT 
GATGCCGGCT 
CCTTCTACGA 
CAGGGCTCCG 



TGTGTCATCT 50 

GCTCATGGAG 100 

CCATCATGAA 150 

GCAGCCCGTG 200 

CCGTGATATG 250 

GGGCCGAGGC 300 

GATGGTGGCT 350 

GCGCGCCGGT 400 

TCACGATCGT 450 

469 



45 2) INFORMATION FOR SEQ ID NO: 962 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 469 bases 

(B) TYPE: Nucleic acid 
50 (C) STRANDEDNESS: Double 

(D) TOPOLOGY : Linear 

(ii) MOLECULE TYPE: Genomic DNA 

55 (vi) ORIGINAL SOURCE: 

(A) ORGANISM : Leisnmania donovani 

(B) STRAIN: ATCC 50212 



60 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 962 

535 
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TGTCATTAGT CAGGCCCTCT CCAAGTACTC 
ATGTCGGCTG CGGCGAGCGC GGTAATGAGA 
TTCCCGACCC TGACGACCGT GATCGATGGC 
GCGCACCTGC CTCGTGGCGA ACACCTCGAA 
5 AGGCCTCTAT TTACACCGGC ATCACCCTGG 
GGCAAGCATA TCGCCATGAT GGCTGACTCG 
GCTTCGTGAG ATTTCGGGTC GTCTGGCGGA 
ACCCCGCCTA CCTCAGCGCT CGTCTCGCCT 
CTCGTCACCT GCATCGGCGG GCCGAAGCGC 
10 CGGTGCCGTG TCTCCACCG 



PCT/CA00/01150 

CAACTCCGAT TGCGTCATCT 50 

TGGCCGAGGT GCTCATGGAG 100 

CGCGAGGAGT CGATCATGAA 150 

CATGCCAGTC GCAGCCCGTG 200 

CCGAGTACTA CCGTGATATG 250 

ACGTCTCGCT GGGCCGAGGC 300 

GATGCCGGCC GATGGTGGCT 350 

CCTTCTACGA GCGCGCCGGC 400 

CAGGGCTCCG TCACGATCGT 450 

469 



2) INFORMATION FOR SEQ ID NO: 963 



15 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 46 9 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 
20. (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
25 (A) ORGANISM: Leishmania hertigi 

(B) STRAIN: ATCC 50125 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 963 

30 TGTGATCAGT CAGGCCCTTT CCAAGTACTC CAACTCGGAC TGCGTCATCT 50 

ACGTCGGCTG TGGTGAGCGC GGGAACGAGA TGGCCGAGGT GCTCATGGAT 100 

TTCCCGACTT TGACGACTGT GATCGATGGT CGCGAGGAGT CCATCATGAA 150 

GCGCACCTGC CTCGTGGCGA ACACCTCCAA CATGCCAGTT GCAGCCCGTG 200 

AGGCTTCTAT CTATACCGGC ATCACACTGG CTGAGTACTA TCGTGATATG 250 

3 5 GGCAAGCACA TTGCCATGAT GGCCGACTCG ACGTCTCGCT GGGCCGAGGC 3 00 

GTTGCGTGAG ATTTCCGGTC GGCTGGCGGA GATGCCGGCC GATGGTGGTT 350 

ACCCCGCCTA CCTCAGTGCC CGTCTCGCCT CCTTCTACGA GCGCGCTGGC 4 00 

CTCGTGACCT GTATCGGCGG GCCGAAGCGC CAGGGCTCCG TCACAATTGT 450 

TGGTGCGGTG TCTCCACCG 4 69 

40 



2) INFORMATION FOR SEQ ID NO: 964 

45 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 469 bases 

(B) TYPE : Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY : Linear 



50 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Leishmania mexicana 
55 (B) STRAIN: ATCC 50156 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 964 

GGTCATTAGT CAGGCCCTTT CCAAGTACTC CAACTCTGAC TGCGTCATCT 50 
60 ACGTCGGCTG CGGCGAGCGC GGTAATGAGA TGGCCGAGGT GCTCATGGAG 100 

536 
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TTCCCGACCC TGACGACCAT 
GCGCACCTGC CTCGTGGCGA 
AGGCCTCTAT CTACACCGGC 
GGCAAGCATA TCGCCATGAT 
GCTTCGTGAG ATTTCGGGTC 
ACCCCGCCTA CCTCAGCGCT 
CTCGTCACCT GCATCGGCGG 
CGGTGCCGTG TCTCCGCCG 



GATCGATGGT 
ACACCTCGAA 
ATCACCCTCG 
GGCCGACTCG 
GTCTGGCGGA 
CGTCTCGCCT 
GCCGAAGCGC 



CGGGAGGAGT CGATCATGAA 
CATGCCCGTC GGAGCCCGTG 
CCGAGTACTA CCGTGATATG 
ACGTCTCGCT GGGCTGAGGC 
GATGCCGGCC GATGGTGGCT 
CCTTCTACGA GCGCGCCGGC 
CAGGGCTCCG TCACGATCGT 



150 
200 
250 
300 
350 
400 
450 
469 



2) INFORMATION FOR SEQ ID NO: 965 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 469 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Leishmania tropica 

(B) STRAIN: ATCC 50129 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 965 

TGTAATTAGT CAGGCCCTCT CCAAGTACTC CAACTCCGAC TGCGTCATCT 50 

ATGTCGGCTG CGGCGAGCGC GGTAATGAGA TGGCCGAGGT GCTCATGGAG 100 

TTCCCGACCC TGACGACCGT GATCGATGGT CGCGAGGAGT CGATCATGAA 150 

GCGCACCTGC CTCGTGGCGA ACACTTCGAA CATGCCAGTC GCAGCCCGTG 200 

AGGCCTCTAT TTACACCGGC ATCACCCTGG CCGAGTACTA CCGTGATATG 250 

GGCAAGCATA TCGCCATGAT GGCCGACTCG ACGTCTCGCT GGGCCGAGGC 3 00 

GCTTCGTGAG ATTTCGGGTC GTTTGGCGGA GATGCCGGCC GATGGTGGCT 350 

ACCCTGCCTA TCTCAGCGCT CGTCTCGCCT CCTTCTACGA GCGCGCCGGC 400 

CTCGTCACCT GCATCGGCGG GCCGAAGCGC CAGGGCTCCG TCACGATTGT 450 

CGGTGCCGTG TCTCCGCCG 469 



2) INFORMATION FOR SEQ ID NO: 966 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 449 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Peptostreptococcus anaerobius 

(B) STRAIN: ATCC 27337 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 966 

CACCAGTTCG CCAAGTGGGC AGATGCTCAG ATAGTTGTAT ACGTTGGTTG 50 

TGGAGAACGT GGTAACGAGA TGACAGACGT TCTAAATGAA TTCCCAGAAC 100 

TGATTGACCC TCATACAGGC GAATCTCTAA TGAAGAGAAC AGTTCTTATA 150 

GCTAATACGT CAAATATGCC AGTTGCAGCC AGAGAGGCAA GTATATATAC 2 00 



537 
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AGGTATTACA ATAGCTGAAT ATTTTAGAGA 
TAATGGCCGA CTCTACATCA AGATGGGCAG 
GGTCGTCTAG AAGAAATGCC TGGTGATGAA 
TTCTAGAGCT GCAGAGTTCT ATGAAAGAGC 
5 GTAGCGATAA TAGAGAGGGA GCCCTTACAA 



CATGGGATAT TCAGTAGCGG 250 

AGGCCCTAAG AGAGATGTCA 3 00 

GGTTATCCAG CCTATCTAGG 350 

AGGTAAGGTA ATATGTAAGG 400 

TAATAGGTGC CGTGTCACC 44 9 



2) INFORMATION FOR SEQ ID NO: 967 

10 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 826 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 
15 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
20 (A) ORGANISM: Bordetella pertussis 

(B) STRAIN: ATCC 97 97 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 967 

25 CTGGTGGTGT CGGCCGCAGA CGGCCCGATG CCGCAGACGC GCGAGCACAT 5 0 

TTTGCTGTCG CGCCAGGTTG GCGTGCCGTA CATCATCGTG TTCCTGAACA 100 

AGGCGGACAT GGTTGATGAC GCGGAGCTGC TCGAGCTGGT GAAGATGGAA 150 

GTCCGCGAAC TGCTGAGCAA GTACGATTTC CCGGGCGATG ACACGCCGAT 2 00 

CGTGAAGGGT TCGGCCAAGC TGGCGCTGGA AGGCGACAAG GGCGAACTGG 250 

30 GCGAGCAGGC GATTCTGTCG CTGGCGCAAG CGCTGGACAC GTACATTCCG 3 00 

ACGCCGGAGC GCGCGGTCGA CGGTGCGTTC CTGATGCCGG TGGAAGACGT 350 

GTTCTCGATC TCGGGCCGTG GCACGGTGGT GACTGGCCGT ATCGAGCGCG 400 

GCGTGGTGAA GGTTGGCGAG GAAATCGAAA TCGTGGGCAT CAAGCCGACG 450 

GTGAAGACGA CCTGCACGGG CGTGGAGATG TTCCGCAAGC TGCTGGACCA 500 

3 5 GGGCCAGGCG GGCGACAACG TGGGTATCTT GCTGCGCGGC ACCAAGCGTG 550 

AAGACGTCGA GCGTGGCCAG GTGCTGGCCA AGCCGGGTTC GATCAACCCG 600 

CACACGGACT TCACGGCCGA GGTGTACATT CTGTCCAAGG AAGAGGGTGG 650 

CCGTCACACG CCGTTCTTCA ACGGCTATCG TCCGCAGTTC TACTTCCGCA 700 

CGACGGACGT GACCGGCACG ATCGACCTGC CGGCGGACAA GGAAATGGTG 750 

4 0 CTGCCGGGCG ACAACGTGTC GATGACCGTC AAGCTGCTGG CCCCGATCGC 800 

CATGGAAGAA GGTCTGCGTT TCGCCA 826 



45 2) INFORMATION FOR SEQ ID NO: 968 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 817 bases 

(B) TYPE: Nucleic acid 
50 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

55 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bordetella. pertussis 

(B) STRAIN: BD180 



60 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 968 

538 
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CGATCCTGGT GGTGTCGGCC GCAGACGGCC 
CACATTTTGC TGTCGCGCCA GGTTGGCGTG 
GAACAAGGCG GACATGGTTG ATGACGCGGA 
TGGAAGTCCG CGAACTGCTG AGCAAGTACG 
5 CCGATCGTGA AGGGTTCGGC CAAGCTGGCG 
ACTGGGCGAG CAGGCGATTC TGTCGCTGGC 
TTCCGACGCC GGAGCGCGCG GTCGACGGTG 
GACGTGTTCT CGATCTCGGG CCGTGGCACG 
GCGCGGCGTG GTGAAGGTTG GCGAGGAAAT 

10 CGACGGTGAA GACGACCTGC ACGGGCGTGG 
GACCAGGGCC AGGCGGGCGA CAACGTGGGT 
GCGTGAAGAC GTCGAGCGTG GCCAGGTGCT 
ACCCGCACAC GGACTTCACG GCCGAGGTGT 
GGTGGCCGTC ACACGCCGTT CTTCAACGGC 

15 CCGCACGACG GACGTGACCG GCACGATCGA 
TGGTGCTGCC GGGCGACAAC GTGTCGATGA 
ATCGCCATGG AAGAAGG 



PCT/CA00/01150 

CGATGCCGCA GACGCGCGAG 50 

CCGTACATCA TCGTGTTCCT 100 

GCTGCTCGAG CTGGTGGAGA 150 

ATTTCCCGGG CGATGACACG 200 

CTGGAAGGCG ACAAGGGCGA 250 

GCAAGCGCTG GACACGTACA 3 00 

CGTTCCTGAT GCCGGTGGAA 350 

GTGGTGACTG GCCGTATCGA 4 00 

CGAAATCGTG GGCATCAAGC 450 

AGATGTTCCG CAAGCTGCTG 50 0 

ATCTTGCTGC GCGGCACCAA 550 

GGCCAAGCCG GGTTCGATCA 600 

ACATTCTGTC CAAGGAAGAG 650 

TATCGTCCGC AGTTCTACTT 70 0 

CCTGCCGGCG GACAAGGAAA 750 

CCGTCAAGCT GCTGGCCCCG 8 00 

817 



20 



30 



2) INFORMATION FOR SEQ ID NO: 969 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 637 bases 
25 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus columbae 

(B) STRAIN: ATCC 51263- 

35 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 969 

CCTATGCCAC AAACTCGTGA ACACATTCTT , TTATCACGTA ACGTTGGTGT 50 

GCCATACATC GTTGTTTTCT TAAACAAAGT TGATATGGTT GACGACGAAG 100 

AATTATTAGA ATTAGTTGAA ATGGAAGTTC GTGACTTATT AACTGAATAT 150 

40 GACTTCCCAG GAGACGATGT TCCTGTAATC GCTGGTTCTG CATTAAAAGC 200 

TTTAGAAGGC GACCCTGCTT ACGAAGAAAA AATCTTAGAA TTAATGGCTG 250 

CAGTTGACGA ATACATCCCA ACTCCAGAAC GTGACAACGA CAAACCATTC 300 

ATGATGCCAG TTGAAGACGT GTTCTCAATT ACTGGTCGTG GTACTGTTGC 350 

TACAGGTCGT GTTGAACGTG GACAAGTTCG TGTTGGTGAC GAAGTTGAAA 400 

45 TCGTTGGTAT CGCTGACGAA ACTTCTAAAA CAACAGTTAC TGGTGTTGAA 450 

ATGTTCCGTA AATTATTAGA TTACGCTGAA GCTGGAGACA ACATCGGTGC 500 

ATTATTACGT GGTGTGGCTC GTGAAGACAT CCAACGTGGT CAAGTATTAG 550 

CTAAACCAGG TTCAATCACT CCACATACAA AATTCACTGC TGAAGTGTAC 600 

GTTTTAACTA AAGAAGAAGG TGGACGTCAT ACTCCAT 637 

50 



2) INFORMATION FOR SEQ ID NO: 970 

55 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 634 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 
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(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus flavescens 
5 (B) STRAIN: ATCC 4 9997 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 970 

TATGCCTCAA ACACGTGAAC ACATCTTGTT ATCACGTAAC GTTGGTGTAC 50 

10 CATACATCGT TGTTTTCTTA AACAAAATGG ATATGGTTGA TGACGAAGAA 100 

TTACTAGAAT TAGTTGAAAT GGAAGTTCGT GACTTATTGT CAGAATATGA 15 0 

CTTCCCAGGC GACGATGTTC CTGTAATCGC TGGTTCTGCT TTGAAAGCTC 2 00 

TTGAAGGCGA TGCTTCATAC GAAGAAAAAA TCATGGAATT AATGGCTGCA 2 50 

GTTGACGAAT ACGTTCCAAC TCCAGAACGT GACACTGACA AACCATTCAT 3 00 

15 GATGCCAGTC GAAGACGTAT TCTCAATCAC TGGACGTGGT ACTGTTGCTA 3 50 

CAGGCCGTGT TGAACGTGGA CAAGTTCGCG TTGGTGACGA AGTTGAAATC 4 00 

GTTGGTATTG CTGAAGAAAC TGCTAAAACA ACTGTAACTG GTGTTGAAAT 450 

GTTCCGTAAA TTGTTAGACT ATGCTGAAGC AGGGGATAAC ATTGGTGCAT 500 

TGCTACGTGG GGTTGCTCGT GAAGACATCC AACGTGGACA AGTATTAGCT 550 

2 0 AAAGCTGGTA CAATCACACC TCATACAAAA TTTAAAGCTG AAGTTTACGT 600 

TTTAACAAAA GAAGAAGGTG GACGTCACAC TCCA 634 



25 2 ) INFORMATION FOR SEQ ID NO: 971 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 787 bases 

(B) TYPE: Nucleic acid 
30 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

3 5 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus pneumoniae 

(B) STRAIN: StrR-55 



40 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 971 



GGACCAATGC CACAAACTCG TGAGCACATC CTTCTTTCAC GTCAGGTTGG 50 

TGTTAAACAC CTTATCGTCT TCATGAACAA AGTTGACTTG GTTGACGACG 100 

AAGAATTGCT TGAATTGGTT GAAATGGAAA TCCGTGACCT ATTGTCAGAA 150 

TACGACTTCC CAGGTGACGA TCTTCCAGTT ATCCAAGGTT CAGCACTTAA 200 

45 AGCTCTTGAA GGTGACTCTA AATACGAAGA CATCGTTATG GAATTGATGA 250 

ACACAGTTGA TGAGTATATC CCAGAACCAG AACGTGACAC TGACAAACCA 300 

TTGCTTCTTC CAGTCGAGGA CGTATTCTCA ATCACTGGAC GTGGTACAGT 350 

TGCTTCAGGA CGTATCGACC GTGGTATCGT TAAAGTCAAC GACGAAATCG 400 

AAATCGTTGG TATCAAAGAA GAAACTCAAA AAGCAGTTGT TACTGGTGTT 450 

50 GAAATGTTCC GTAAACAACT TGACGAAGGT CTTGCTGGAG ATAACGTAGG 50 0 

TGTCCTTCTT CGTGGTGTTC AACGTGATGA AATCGAACGT GGACAAGTTA 55 0 

TCGCTAAACC AGGTTCAATC AACCCACACA CTAAATTCAA AGGTGAAGTC 60 0 

TACATCCTTA CTAAAGAAGA AGGTGGACGT CACACTCCAT TCTTCAACAA 650 

CTACCGTCCA CAATTCTACT TCCGTACTAC TGACGTTACA GGTTCAATCG 700 

55 AACTTCCAGC AGGTACTGAA ATGGTAATGC CTGGTGATAA CGTGACAATC 750 

GACGTTGAGT TGATTCACCC AATCGCCGTA GAACAAG 787 



60 2) INFORMATION FOR SEQ ID NO: 972 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 803 bases 

(B) TYPE: Nucleic acid 

5 (C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

10 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Escherichia coli 

(B) STRAIN: ATCC 35401 



15 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 972 



TAGTTGCTGC GACTGACGGC CCGATGCCGC AGACTCGTGA GCACATCCTG 50 

CTGGGTCGTC AGGTAGGCGT TCCGTACATC ATCGTGTTCC TGAACAAATG 100 

CGACATGGTT GATGACGAAG AGCTGCTGGA ACTGGTTGAA ATGGAAGTTC 150 

GTGAACTTCT GTCTCAGTAC GACTTCCCGG GCGACGACAC TCCGATCGTT 2 00 

20 CGTGGTTCTG CTCTGAAAGC GCTGGAAGGC GACGCAGAGT GGGAAGCGAA 250 

AAT C CTGGAA CTGGCTGGCT TCCTGGATTC TTACATTCCG GAACCAGAGC 300 

GTGCGATTGA CAAGCCGTTC CTGCTGCCGA TCGAAGACGT ATTCTCCATC 3 50 

TCCGGTCGTG GTACCGTTGT TACCGGTCGT GTAGAACGCG GTATCATCAA 400 

AGTTGGTGAA GAAGTTGAAA TCGTTGGTAT CAAAGAGACT CAGAAGTCTA 45 0 

2 5 CCTGTACTGG CGTTGAAATG TTCCGCAAAC TGCTGGACGA AGGCCGTGCT 500 

GGTGAGAACG TAGGTGTTCT GCTGCGTGGT ATCAAACGTG AAGAAATCGA 550 

ACGTGGTCAG GTACTGGCTA AGCCGGGCAC CATCAAGCCG CACACCAAGT 600 

TCGAATCTGA AGTGTACATT CTGTCCAAAG ATGAAGGCGG CCGTCATACT 650 

CCGTTCTTCA AAGGCTACCG TCCGCAGTTC TACTTCCGTA CTACTGACGT 700 

3 0 GACTGGTACC ATCGAACTGC CGGAAGGCGT AGAGATGGTA ATGCCGGGCG 750 

ACAACATCAA AATGGTTGTT ACCCTGATCC ACCCGATCGC GATGGACGAC 800 

GGT 803 



35 



45 



2 ) INFORMATION FOR SEQ ID NO: 973 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 762 bases 
40 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE : 

(A) ORGANISM: Escherichia coli 

(B) STRAIN: ATCC 43895 

50 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 973 

ACTGACGGCC CGATGCCGCA GACTCGTGAG CACATCCTGC TGGGTCGTCA 50 

GGTAGGCGTT CCGTACATCA TCGTGTTCCT GAACAAATGC GACATGGTTG 100 

ATGACGAAGA GCTGCTGGAA CTGGTTGAAA TGGAAGTTCG TGAACTTGTG 150 

55 TCTCAGTACG ACTTCCCGGG CGACGACACT CCGATCGTTC GTGGTTCTGC 200 

TCTGAAAGCG CTGGAAGGCG ACGCAGAGTG GGAAGCGAAA ATCCTGGAAC 250 

TGGCTGGCTT CCTGGATTCC TACATTCCGG AACCAGAGCG TGCRATTGAC 300 

AAGCCGTTCC TGCTGCCGAT CGAAGACGTA TTCTCCATCT CCGGTCGTGG 350 

TACCGTTGTT ACCGGTCGTG TAGAACGCGG TATCATCAAA GTTGGTGAAG 400 

60 AAGTTGAAAT CGTTGGTATC AAAGAGACTC AGAAGTCTAC CTGTACTGGC 450 

541 
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GTTGAAATGT TCCGCAAACT GCTGGACGAA 
AGGTGTTCTG CTGCGTGGTA TCAAACGTGA 
TACTGGCTAA GCCGGGCACC ATCAAGCCGC 
GTGTACATTC TGTCCAAAGA TGAAGGCGGC 
5 AGGCTACCGT CCGCAGTTCT ACTTCCGTAC 
TTGAACTGCC GGAAGGCGTA GAGATGGTAA 
ATGGTTGTTA CC 



PCT/CA00/01150 

GGCCGTGCTG GTGAGAACGT 500 

AGAAATCGAA CGTGGTCAGG 550 

ACACCAAGTT CGAATCTGAA 600 

CGTCATACTC CGTTCTTCAA 650 

TACTGACGTG ACTGGTACCA 7 00 

TGCCGGGCGA CAACATCAAA 750 

762 



10 



15 



20 



25 



2 ) INFORMATION FOR SEQ ID NO: 974 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 804 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Escherichia, coli 

(B) STRAIN: ATCC 11775 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 974 



CCTGGTAGTT 
TCCTGCTGGG 
AAATGCGACA 

30 AGTTCGTGAA 
-TCGTTCGTGG 
GCGAAAATCC 
AGAGCGTGCG 
CCATCTCCGG 

35 ATCAAAGTTG 
GTCTACCTGT 
GTGCTGGTGA 
ATCGAACGTG 
CAAGTTCGAA 

40 ATACTCCGTT 
GACGTGACTG 
GGGCGACAAC 
ACGA 



GCTGCGACTG 
TCGTCAGGTA 
TGGTTGATGA 
CTTCTGTCTC 
TTCTGCTCTG 
TGGAACTGGC 
ATTGACAAGC 
TCGTGGTACC 
GTGAAGAAGT 
ACTGGCGTTG 
GAACGTAGGT 
GTCAGGTACT 
TCTGAAGTGT 
CTTCAAAGGC 
GTACCATCGA 
ATCAAAATGG 



ACGGCCCGAT 
GGCGTTCCGT 
CGAAGAGCTG 
AGTACGACTT 
AAAGCGCTGG 
TGGCTTCCTG 
CGTTCCTGCT 
GTTGTTACCG 
TGAAATCGTT 
AAATGTTCCG 
GTTCTGCTGC 
GGCTAAGCCG 
ACATTCTGTC 
TACCGTCCGC 
ACTGCCGGAA 
TTGTTACCCT 



GCCGCAGACT 
ACATCATCGT 
CTGGAACTGG 
CCCGGGCGAC 
AAGGCGACGC 
GATTCCTACA 
GCCGATCGAA 
GTCGTGTAGA 
GGTATCAAAG 
CAAACTGCTG 
GCGGTATCAA 
GGCACCATCA 
CAAAGATGAA 
AGTTCTACTT 
GGCGTAGAGA 
GATCCACCCG 



CGTGAGCACA 50 

GTTCCTGAAC 100 

TTGAAATGGA 150 

GACACTCCGA 2 00 

AGAGTGGGAA 250 

TTCCGGAACC 300 

GACGTATTCT 350 

ACGCGGTATC 400 

AGACTCAGAA 450 

GACGAAGGCC 500 

ACGTGAAGAA 550 

AGCCGCACAC 600 

GGCGGTCGTC 650 

CCGTACTACT 700 

TGGTAATGCC 750 

ATCGCGATGG 800 

804 



45 



50 



55 



60 



2) INFORMATION FOR SEQ ID NO: 975 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 804 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Escherichia, coli 

(B) STRAIN: ATCC 25922 
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SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 



PCT/CA00/01150 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 975 



GCGATCCTGG 
GCACATCCTG 
TGAACAAATG 
ATGGAAGTTC 
TCCGATCGTT 
GGGAAGCGAA 
GAACCAGAGC 
ATTCTCCATC 
GTATCATCAA 
CAGAAGTCTA 
AGGCCGTGCT 
AAGAAATCGA 
CACACCAAGT 
TCGTCATACT 
CTACTGACGT 
ATGCCGGGCG 
GATG 



TAGTTGCTGC 
CTGGGTCGTC 
CGACATGGTT 
GTGAACTTCT 
CGTGGTTCTG 
AATCCTGGAA 
GTGCGATTGA 
TCCGGTCGTG 
AGTTGGTGAA 
CCTGTACTGG 
GGTGAGAACG 
ACGTGGTCAG 
TCGAATCTGA 
CCGTTCTTCA 
GACTGGTACC 
ACAACATCAA 



GACTGACGGC CCGATGCCGC AGACTCGTGA 
AGGTAGGCGT TCCGTACATC ATCGTGTTCC 
GATGACGAAG AGCTGCTGGA ACTGGTTGAA 
GTCTCAGTAC GACTTCCCGG GCGACGACAC 
CTCTGAAAGC GCTGGAAGGC GACGCAGAGT 
CTGGCTGGCT TCCTGGATTC YTACATTCCG 
CAAGCCGTTC CTGCTGCCGA TCGAAGACGT 
GTACCGTTGT TACCGGTCGT GTAGAACGCG 
GAAGTTGAAA TCGTTGGTAT CAAAGAGACT 
CGTTGAAATG TTCCGCAAAC TGCTGGACGA 
TAGGTGTTCT GCTGCGTGGT ATCAAACGTG 
GTACTGGCTA AGCCGGGCAC CATCAAGCCG 
AGTGTACATT CTGTCCAAAG ATGAAGGCGG 
AAGGCTACCG TCCGCAGTTC TACTTCCGTA 
ATCGAACTGC CGGAAGGTGT AGAGATGGTA 
AATGGTTGTT ACCCTGATCC ACCCGATCGC 



50 
100 
150 
200 
250 
300 
350 
400 
450 
500 
550 
600 
650 
700 
750 
800 
804 



2) INFORMATION FOR SEQ ID NO: 976 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 82 5 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Mycobacterium avium 

(B) STRAIN: Mavi-1 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 976 

GGCGCGATCC TGGTGGTCGC CGCCACCGAC GGCCCGATGC CGCAGACCCG 50 

TGAGCACGTG CTGCTCGCCC GTCAGGTCGG TGTGCCCTAC ATCCTGGTCG 100 

CCCTGAACAA GGCCGACATG GTCGACGACG AGGAGCTCCT CGAGCTCGTC 150 

GAGATGGAGG TCCGCGAGCT GCTGGCCGCC CAGGAGTTCG ACGAGGACGC 200 

CCCGGTGGTG CGGGTCTCGG CGCTCAAGGC GCTCGAGGGC GACGCCAAGT 250 

GGGTCGAGTC CGTCGAGCAG CTGATGGAGG CCGTCGACGA GTCGATCCCG 3 00 

GACCCGGTCC GCGAGACGGA GAAGCCGTTC CTGATGCCGG TGGAGGACGT 350 

CTTCACCATC ACCGGGCGTG GCACCGTGGT CACCGGTCGT GTCGAGCGCG 400 

GTGTGATCAA CGTGAACGAG GAAGTCGAGA TCGTCGGCAT CCGCCCGACC 450 

AGCACCAAGA CCACGGTCAC CGGTGTGGAR ATGTTCCGCA AGCTGCTCGA 500 

CCAGGGCCAG GCCGGTGACA ACGTCGGTCT GCTGCTGCGT GGTATCAAGC 550 

GTGAGGACGT CGAGCGCGGC CAGGTCGTGA CCAAGCCCGG CACCACCACG 60 0 

CCGCACACCG AGTTCGAGGG CCAGGTCTAC ATCCTGTCCA AGGACGAGGG 650 

CGGCCGGCAC ACGCCGTTCT TCAACAACTA CCGTCCGCAG TTCTACTTCC 700 

GCACCACCGA CGTGACCGGT GTGGTGACGC TGCCGGAGGG CACCGAGATG 750 

GTGATGCCCG GTGACAACAC CAACATCTCG GTGAAGCTGA TCCAGCCCGT 800 

CGCCATGGAC GACGGTSTGC GGTTC 825 



2) INFORMATION FOR SEQ ID NO: 977 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 820 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 
5 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
10 (A) ORGANISM: Streptococcus pneumoniae 

(B) STRAIN: StrR-06 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 977 

15 TATCCTTGTA GTAGCTTCAA CTGACGGACC AATGCCACAA ACTCGTGAGC 50 

ACATCCTTCT TTCACGTCAG GTTGGTGTTA AACACCTTAT CGTCTTCATG 100 

AACAAAGTTG ACTTGGTTGA CGACGAAGAA TTGCTTGAAT TGGTTGAAAT 150 

GGAAATCCGT GACCTATTGT CAGAATACGA CTTCCCAGGT GACGATCTTC 200 

CAGTTATCCA AGGTTCAGCA CTTAAAGCTC TTGAAGGTGA CTCTAAATAC 250 

20 GAAGACATCG TTATGGAATT GATGAACACA GTTGATGAGT ATATCCCAGA 300 

ACCAGAACGT GACACTGACA AACCATTGCT TCTTCCAGTC GAGGACGTAT 350 

TCTCAATCAC TGGACGTGGT ACAGTTGCTT CAGGACGTAT CGACCGTGGT 400 

ATCGTTAAAG TCAACGACGA AATCGAAATT GTTGGTATCA AAGAAGAAAC 450 

TCAAAAAGCA GTTGTTACTG GTGTTGAAAT GTTCCGTAAA CAACTTGACG 500 

25 AAGGTCTTGC TGGAGATAAC GTAGGTGTCC TTCTTCGTGG TGTTCAACGT 550 

GATGAAATCG AACGTGGACA AGTTATCGCT AAACCAGGTT CAATCAACCC 600 

ACACACTAAA TTCAAAGGTG AAGTCTACAT CCTTACTAAA GAAGAAGGTG 650 

GACGTCACAC TCCATTCTTC AACAACTACC GTCCACAATT CTACTTCCGT 700 

ACTACTGACG TTACAGGTTC AATCGAACTT CCAGCAGGTA CTGAAATGGT 750 

30 AATGCCTGGT GATAACGTGA CAATCGACGT TGAGTTGATT CACCCAATCG 800 

CCGTAGAACA AGGTACTACA 82 0 



35 2) INFORMATION FOR SEQ ID NO: 978 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 822 bases 

(B) TYPE: Nucleic acid 
40 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

45 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Mycobacterium gordonae 

(B) STRAIN: M-Gor-1 



50 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 978 



GGCGCGATCC TGGTGGTCGC CGCCACCGAT GGCCCGATGC CGCAGACCCG 50 

TGAGCACGTG CTGCTCGCCC GTCAGGTGGG CGTGCCCTAC ATCCTGGTGG 100 

CGCTGAACAA GTCCGACGCG GTCGACGACG AGGAGCTGCT CGAGCTCGTC 150 

GAGCTGGAGG TCCGCGAGTT GCTGGCCGCC CAGGACTTCG ACGAGGAAGC 200 

55 TCCGGTGGTC CGGGTCTCGG CGCTGAAGGC GCTCGAGGGC GACGCCACCT 250 

GGGTGAAGTC GGTAGAGGAC TTGATGGACG CGGTCGACGA GTCGATTCCG 3 00 

GACCCGGTCC GCGACACCGA CAAGCCGTTC CTGATGCCCG TCGAGGACGT 350 

CTTCACCATC ACCGGTCGTG GCACCGTCGT CACCGGCCGT GTGGAGCGCG 4 00 

GCGTGGTGAA CGTGAACGAG GAAGTCGAGA TCGTCGGCAT CAAGCCGACC 450 

60. AGCACCAAGA CCACGGTCAC CGGTGTGGAG ATGTTCCGGA AGCTGCTCGA 500 

544 
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CCAGGGTCAG GCCGGTGACA ACGTCGGTCT 
GTGAGGACGT CGAGCGCGGC CAGGTCGTCA 
CCGCACACCG AGTTCGAGGG TCAGGTCTAC 
CGGCCGGCAC ACGCCGTTCT TCAACAACTA 
5 GCACCACCGA CGTGACCGGT GTGGTGACGC 
GTGATGCCCG GTGACAACAC CAACATCTCG 
CGCCATGGAC GACGGTCTGC GG 



PCT/CA00/01150 

GCTGCTGCGT GGTGTCAAGC 55 0 

TCAAGCCCGG CACCACCACT 600 

ATCCTGTCCA AGGACGAGGG 650 

CCGTCCGCAG TTCTACTTCC 700 

TGCCGGAGGG CACCGAAATG 750 

GTGAAGCTGA TCCAGCCCGT 800 

822 



10 



20 



45 



2) INFORMATION FOR SEQ ID NO: 979 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 821 bases 
15 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus pneumoniae 

(B) STRAIN: StrR-11 

25 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 979 

CTATCCTTGT AGTAGCTTCA ACTGACGGAC CAATGCCACA AACTCGTGAG 50 

CACATCCTTC TTTCACGTCA GGTTGGTGTT AAACACCTTA TCGTCTTCAT 100 

GAACAAAGTT GACTTGGTTG ACGACGAAGA ATTGCTTGAA TTGGTTGAAA 150 

3 0 TGGAAATCCG TGACCTATTG TCAGAATACG ACTTCCCAGG TGACGATCTT 200 

CCAGTTATCC AAGGTTCAGC ACTTAAAGCT CTTGAAGGTG ACTCTAAATA 250 

CGAAGACATC GTTATGGAAT TGATGAACAC AGTTGATGAG TATATCCCAG 300 

AACCAGAACG TGACACTGAC AAACCATTGC TTCTTCCAGT CGAGGACGTA 350 

TTCTCAATCA CTGGACGTGG TACAGTTGCT TCAGGACGTA TCGACCGTGG 400 

35 TATCGTTAAA GTCAACGACG AAATCGAAAT CGTTGGTATC AAAGAAGAAA 450 

CTCAAAAAGC AGTTGTTACT GGTGTTGAAA TGTTCCGTAA ACAACTTGAC 50 0 

GAAGGTCTTG CTGGAGATAA CGTAGGTGTC CTTCTTCGTG GTGTTCAACG 550 

TGATGAAATC GAACGTGGAC AAGTTATCGC TAAACCAGGT TCAATCAACC 60 0 

CACACACTAA ATTCAAAGGT GAAGTCTACA TCCTTACTAA AGAAGAAGGT 650 

4 0 GGACGTCACA CTCCATTCTT CAACAACTAC CGTCCACAAT TCTACTTCCG 700 

TACTACTGAC GTTACAGGTT CAATCGAACT TCCAGCAGGT ACTGAAATGG 750 

TAATGCCTGG TGATAACGTG ACAATCGACG TTGAGTTGAT TCACCCAATC 80 0 

GCCGTAGAAC AAGGTACTAC A 821 



2) INFORMATION FOR SEQ ID NO: 980 



(i) SEQUENCE CHARACTERISTICS: 
50 (A) LENGTH: 82 8 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

55 (ii) MOLECULE TYPE: Genomic DNA 

(Vi) ORIGINAL SOURCE: 

(A) ORGANISM: Mycobacterium tuberculosis 

(B) STRAIN: ATCC 25177 

60 

545 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 980 



10 



15 



20 



GGTGCGATCC 
CGAGCACGTT 
CGCTGAACAA 
GAGATGGAGG 
CCCGGTTGTG 
GGGTTGCCTC 
GACCCGGTCC 
CTTCACCATT 
GCGTGATCAA 
ACCACCAAGA 
CCAGGGCCAG 
GCGAGGACGT 
CCGCACACCG 
CGGCCGGCAC 
GCACCACCGA 
GTGATGCCCG 
CGCCATGGAC 



TGGTGGTCGC 
CTGCTGGCGC 
GGCCGACGCA 
TCCGCGAGCT 
CGGGTCTCGG 
TGTCGAGGAA 
GCGAGACCGA 
ACCGGCCGCG 
CGTGAACGAG 
CCACCGTCAC 
GCGGGCGACA 
CGAGCGTGGC 
AGTTCGAAGG 
ACGCCGTTCT 
CGTGAC CGGT 
GTGACAACAC 
GAAGGTNTGC 



CGCCACCGAC 
GTCAAGTGGG 
GTGGACGACG 
GCTGGCTGCC 
CGCTCAAGGC 
CTGATGAACG 
CAAGCCGTTC 
GAACCGTGGT 
GAAGTTGAGA 
CGGTGTGGAG 
ACGTTGGTTT 
CAGGTTGTCA 
CCAGGTCTAC 
TCAACAACTA 
GTGGTGACAC 
CAACATCTCG 
GTTTCGCG 



GGCCCGATGC 
TGTGCCCTAC 
AGGAGCTGCT 
CAGGAATTCG 
GCTCGAGGGT 
CGGTCGACGA 
CTGATGCCGG 
CACCGGACGT 
TCGTCGGCAT 
ATGTTCCGCA 
GCTGCTGCGG 
CCAAGCCCGG 
ATCCTGTCCA 
CCGTCCGCAG 
TGCCGGAGGG 
GTGAAGTTGA 



PCT/CA00/01150 



CCCAGACCCG 50 

ATCCTGGTAG 10 0 

CGAACTCGTC 150 

ACGAGGACGC 200 

GACGCGAAGT 250 

GTCGATTCCG 3 00 

TCGAGGACGT 350 

GTGGAGCGCG 400 

TCGCCCATCG 45 0 

AGCTGCTCGA 500 

GGCGTCAAGC 550 

CACCACCACG 600 

AGGACGAGGG 650 

TTCTACTTCC 700 

CACCGAGATG 750 

TCCAGCCCGT 80 0 

828 



25 



30 



35 



40 



45 



50 



55 



2) INFORMATION FOR SEQ ID NO: 981 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 819 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Staphylococcus warneri 

(B) STRAIN: CSG 144 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 981 



CGGCGGTATC 
GTGAACACAT 
TTCTTAAACA 
TGAAATGGAA 
ATGTACCAGT 
GAATACGAAC 
CCCAACTCCA 
ACGTATTCTC 
CGTGGTCAAA 
AGAAAGCAAG 
TAGACTACGC 
GCACGTGAAG 
TACTCCACAC 
AAGGTGGACG 
TTCCGTACTA 
AATGGTTATG 
CAATCGCGAT 



TTAGTAGTAT 
CTTATTATCA 
AAGCTGACAT 
GTTCGTGACT 
TATCGTTGGT 
AAAAAATCTT 
GAACGTGACT 
AATCACTGGT 
TCAAAGTCGG 
AAAACAACAG 
TGAAGCTGGT 
ACGTACAACG 
ACAAAATTCA 
TCATACTCCA 
CTGACGTAAC 
CCTGGCGATA 
TGAAGACGG 



CTGCTGCAGA 
CGTAACGTTG 
GGTTGACGAC 
TATTATCTGA 
TCTGCATTAA 
AGACTTAATG 
CTGATAAACC 
CGTGGTACTG 
TGAAGAAGTT 
TTACAGGTGT 
GACAACATCG 
TGGACAAGTA 
AAGCTGATGT 
TTCTTCACTA 
TGGCGTTGTT 
ACGTAGAAAT 



TGGCCCAATG 
GTGTACCAGC 
GAAGAATTAT 
ATACGACTTC 
AAGCTTTAGA 
CAAGCTGTAG 
ATTCATGATG 
TAGCAACAGG 
GAAATCATCG 
AGAAATGTTC 
GTGCTTTATT 
TTAGCAGCTC 
TTACGTTTTA 
ACTACCGCCC 
CACTTACCAG 
GACTGTTGAA 



CCACAAACTC 50 

TTTAGTTGTA 100 

TAGAATTAGT 150 

CCTGGTGACG 200 

AGGCGACCCA 250 

ATGACTACAT 3 00 

CCAGTTGAGG 350 

CCGTGTTGAA 400 

GTATCACTGA 450 

CGTAAATTAT 500 

ACGTGGTGTT 550 

CTGGCTCTAT 600 

TCTAAAGAAG 650 

ACAATTCTAC 700 

AAGGTACTGA 750 

TTAATCGCTC 800 

819 



60 



2) INFORMATION FOR SEQ ID NO: 982 
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(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 814 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus mitis 

(B) STRAIN: LSPQ 2583 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 982 

CTATCCTTGT AGTAGCTTCA AC TG ACGGAC CAATGCCACA AACTCGTGAG 50 

CACATCCTTC TTTCACGTCA GGTTGGTGTT AAACACCTTA TCGTCTTCAT 100 

GAACAAGATC GACTTGGTTG ATGACGAAGA ATTGCTTGAA TTGGTTGAAA 150 

TGGAAATCCG TGACCTCTTG TCAGAATACG ACTTCCCAGG TGACGATCTT 200 

CCAGTTATCC AAGGTTCAGC TCTTAAAGCT CTTGAAGGTG ATACTAAGTA 250 

CGAAGACATC ATCATGGAAT TGATGAACAC TGTTGATGAG TACATCCCAG 300 

AACCAGAACG TGATACTGAC AAAC CTCTTC TTCTTCCAGT CGAAGACGTA 350 

TTCTCAATCA CTGGTCGTGG TACAGTTGCT TCAGGACGTA TCGACCGTGG 400 

TACTGTTCGT GTCAACGATG AAATCGAAAT CGTTGGTATC AAA.GAAGAAA 450 

TCCAAAAAGC AGTTGTTACT GGTGTTGAAA TGTTCCGTAA ACAGCTTGAC 500 

GAAGGTCTTG CAGGGGACAA CGTAGGTGTA CTTCTTCGTG GTATCCAACG 550 

TGATGAAATC GAACGTGGTC AAGTTATCGC TAAACCAGGT TCAATCAACC 600 

CACACACTAA ATTCAAGGGT GAAGTTTACA TCCTTACTAA AGAAGAAGGT 650 

GGACGTCACA CTCCATTCTT CAACAACTAC CGTCCACAGT TCTACTTCCG 700 

TACAACTGAC GTTACAGGTT CAATCGAACT TCCAGCAGGT ACTGAAATGG 750 

TAATGCCTGG TGATAACGTA ACTATCGACG TTGAGTTGAT CCACCCAATC 800 

GCCGTTGAAC AAGG 814 



2) INFORMATION FOR SEQ ID NO: 983 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 810 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus mitis 

(B) STRAIN: ATCC 49456 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 983 

CTATCCTTGT AGTAGCTTCA ACTGACGGAC CAATGCCACA AACTCGTGAG 50 

CACATCCTTC TTTCACGTCA GGTTGGTGTT AAACACCTTA TCGTCTTCAT 100 

GAACAAAGTT GACTTGGTTG ACGACGAAGA ATTGCTTGAA TTGGTTGAAA 150 

TGGAAATCCG TGACCTATTG TCAGAATACG ACTTCCCAGG TGACGATCTT 200 

CCAGTTATCC AAGGTTCAGC TCTTAAAGCC CTTGAAGGTG ACACTAAATA 250 

CGAAGACATC GTTATGGAAT TGATGAACAC AGTTGATGAG TACATCCCAG 300 

AACCAGAACG TGACACTGAC AAACCATTGC TTCTTCCAGT CGAAGACGTA 350 

TTCTCAATCA CTGGTCGTGG TACAGTTGCT TCAGGACGTA TCGACCGTGG 400 

TATCGTTAAA GTCAACGACG AAATCGAAAT CGTTGGTATC AAAGAAGAAA 450 

CTCAAAAAGC AGTTGTTACT GGTGTTGAAA TGTTCCGTAA ACAACTTGAC 500 

547 
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GAAGGTCTTG CCGGAGATAA TGTAGGTGTC 
TGATGAAATC GAACGTGGAC AAGTTATTGC 
CACACACTAA ATTCAAAGGT GAAGTTTACA 
GGACGTCACA CTCCATTCTT CAACAACTAC 
5 TACTACTGAC GTTACAGGTT CAATCGAACT 
TAATGCCTGG TGATAACGTG ACAATCGACG 
GCCGTAGAAC 



PCT/CA00/01150 

CTTCTTCGTG GTGTTCAACG 550 

TAAACCAGGT TCAATCAACC 600 

TCCTTACTAA AGAAGAAGGT 650 

CGTCCACAAT TCTACTTCCG 700 

TCCAGCAGGT ACTGAAATGG 750 

TTGAGTTGAT CCACCCAATC 800 

810 



10 



20 



45 



60 



2) INFORMATION FOR SEQ ID NO: 984 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 817 bases 
15 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus mitis 

(B) STRAIN: ATCC 903 

25 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 984 

TCCTTGTAGT AGCTTCAACT GACGGACCAA TGCCACAAAC TCGTGAGCAC 50 

ATCCTTCTTT CACGTCAGGT TGGTGTTAAA CACCTTATCG TCTTCATGAA 10 0 

CAAGATCGAC TTGGTTGATG ACGAAGAATT GCTTGAATTG GTTGAAATGG 150 

30 AAATC CGTGA CCTCTTGTCA GAATACGACT TCCCAGGTGA CGATCTTCCA 2 00 

GTTATCCAAG GTTCAGCTCT TAAAGCTCTT GAAGGTGATA CTAAGTACGA 250 

AGACATCATC ATGGAATTGA TGAACACTGT TGATGAGTAC ATCCCAGAAC 3 00 

CAGAACGTGA TACTGACAAA CCTCTTCTTC TTCCAGTCGA AGACGTATTC 350 

TCAATCACTG GTCGTGGTAC AGTTGCTTCA GGACGTATCG ACCGTGGTAC 400 

3 5 TGTTCGTGTC AACGATGAAA TCGAAATCGT TGGTATCAAA GAAGAAATCC 450 

AAAAAGCAGT TGTTACTGGT GTTGAAATGT TCCGTAAACA GCTTGACGAA 500 

GGTCTTGCAG GGGACAACGT AGGTGTACTT CTTCGTGGTA TCCAACGTGA 550 

TGAAATCGAA CGTGGTCAAG TTATCGCTAA ACCAGGTTCA ATCAACCCAC 600 

ACACTAAATT CAAGGGTGAA GTTTACATCC TTACTAAAGA AGAAGGTGGA 650 

40 CGTCACACTC CATTCTTCAA CAACTACCGT CCACAGTTCT ACTTCCGTAC 700 

AACTGACGTT ACAGGTTCAA TCGAACTTCC AGCAGGTACT GAAATGGTAA 750 

TGCCTGGTGA TAACGTAACT ATCGACGTTG AGTTGATCCA CCCAATCGCC 800 

GTTGAACAAG GTACTAC 817 



2) INFORMATION FOR SEQ ID NO: 985 



(i) SEQUENCE CHARACTERISTICS: 
50 (A) LENGTH: 798 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY; Linear 

55 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus oralis 

(B) STRAIN: ATCC 3 503 7 



548 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 985 

GACGGACCAA TGCCACAAAC TCGTGAGCAC ATCCTTCTTT CACGTCAGGT 50 

TGGTGTTAAA CACCTTATTG TCTTCATGAA CAAAATTGAC TTGGTAGACG 100 

5 ACGAAGAATT GCTTGAATTG GTTGAAATGG AAATCCGTGA CCTCTTGTCA 150 

GAATACGACT TCCCAGGTGA CGATCTTCCA GTTATCCAAG GTTCAGCTCT 2 00 

TAAAGCTCTT GAAGGTGACT CTAAATACGA AGACATCATT ATGGAATTGA 250 

TGAACACTGT TGATGAGTAC ATCCCAGAAC CAGAACGTGA CACTGAAAAA 3 00 

CCATTGCTTC TTCCAGTCGA AGACGTATTC TCAATCACTG GACGTGGTAC 350 

10 AGTTGCTTCA GGACGTATCG ACCGTGGTAC TGTTCGTGTC AACGACGAAA 4 00 

TCGAAATCGT TGGTATCAAA GAAGAAACTC AAAAAGCAGT TGTTACTGGT 450 

GTTGAAATGT TCCGTAAACA ACTTGACGAA GGTCTTGCCG GAGATAACGT 500 

AGGTGTCCTT CTTCGTGGTG TTCAACGTGA CGAAATCGAA CGTGGACAAG 550 

TTATCGCTAA ACCAGGTTCA ATCAACCCAC ACACTAAATT TAAAGGTGAA 60 0 

15 GTCTACATCC TTACTAAAGA AGAAGGTGGA CGTCACACTC CATTCTTCAA 65 0 

CAACTACCGT CCACAATTCT ACTTCCGTAC TACTGACGTT ACAGGTTCAA 700 

TCGAACTTCC TGCAGGTACT GAAATGGTAA TGCCTGGTGA TAACGTGACT 750 

ATCGACGTTG AGTTGATCCA CCCAATCGCC GTAGAACAAG GTACTACA 798 



20 



2 ) INFORMATION FOR SEQ ID NO: 986 



(i) SEQUENCE CHARACTERISTICS: 
25 (A) LENGTH: 815 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

3 0 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus pneumoniae 

(B) STRAIN: StrR-05 

35 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 986 



TTGTAGTAGC TTCAACTGAC GGACCAATGC CACAAACTCG TGAGCACATC 50 

CTTCTTTCAC GTCAGGTTGG TGTTAAACAC CTTATCGTCT TCATGAACAA 100 

40 AGTTGACTTG GTTGACGACG AAGAATTGCT TGAATTGGTT GAAATGGAAA 150 

TCCGTGACCT ATTGTCAGAA TACGACTTCC CAGGTGACGA TCTTCCAGTT 200 

ATCCAAGGTT CAGCACTTAA AGCTCTTGAA GGTGACTCTA AATACGAAGA 250 

CATCGTTATG GAATTGATGA ACACAGTTGA TGAGTATATT CCAGAACCAG 3 00 

AACGTGACAC TGACAAACCA TTGCTTCTTC CAGTCGAGGA CGTATTCTCA 350 

4 5 ATCACTGGAC GTGGTACAGT TGCTTCAGGA CGTATCGACC GTGGTATCGT 4 00 

TAAAGTCAAC GACGAAATCG AAATCGTTGG TATCAAAGAA GAAACTCAAA 450 

AAGCAGTTGT TACTGGTGTT GAAATGTTCC GTAAACAACT TGACGAAGGT 500 

CTTGCTGGAG ATAACGTAGG TGTCCTTCTT CGTGGTGTTC AACGTGATGA 550 

AATCGAACGT GGACAAGTTA TCGCTAAACC AGGTTCAATC AACCCACACA 600 

50 CTAAATTCAA AGGTGAAGTC TACATCCTTA CTAAAGAAGA AGGTGGACGT 650 

CACACTCCAT TCTTCAACAA CTACCGTCCA CAATTCTACT TCCGTACTAC 700 

TGACGTTACA GGTTCAATCG AACTTCCAGC AGGTACTGAA ATGGTAATGC 750 

CTGGTGATAA CGTGACAATC GACGTTGAGT TGATTCACCC AATCGCCGTA 800 

GAACAAGGTA CTACA 815 

55 



2) INFORMATION FOR SEQ ID NO: 987 

60 (i) SEQUENCE CHARACTERISTICS: 

549 
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WO 01/23604 PCT/CA00/01 150 

(A) LENGTH: 832 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

5 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus hirae 
10 (B) STRAIN: ATCC 8043 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 987 

CGGACAATCT TGGTTGTTTC TGCTACGGAT GGCCCAATGC CTCAAACTCG 50 

15 TGAACACATT TTACTTTCTC GCCAAGTAGG CGTGAAATAT TTGATTGTTT 100 

TCTTGAACAA AACAGATTTA GTCGATGATG AAGAATTAAT TGATCTAGTA 150 

GAAATGGAAG TTCGTGAACT ATTAAGCGAA TATGGATTTC CAGGTGATGA 200 

TACACCTGTT ATCAAAGGAT CAGCATTAAA AGCATTACAA GGTGATCCTG 250 

ATGCAGAAGC AGCTATCATG GAATTGATGG ACACAGTCGA TGAATACATC 300 

20 CCAACACCAG AACGTGATAC GGACAAACCA TTATTGTTAC CCGCCGAAGA 350 

TGTATTTTCA ATTACTGGAC GTGGAACAGT AGCATCTGGT CGGATTGACC 400 

GTGGGGCTGT TAGAGTCGGT GACGAAATCG AAATCGTAGG GATCAAACCA 450 

GAAACGCAAA GAGCCGTAGT AACAGGAGTT GAAATGTTCC GCAAAACGCT 500 

TGATTACGGT GAAGCAGGGG ATAACGTAGG TGTGTTATTA CGTGGGATTC 550 

25 AAAGAGAAGA CATCGAACGT GGCCAAGTGA TTGCCAAACC TGGTTCAATT 600 

ACACCTCATA CTAAATTCAA AGCAGAAGTT TAGGTTTTGA CTAAAGAAGA 650 

AGGCGGACGT CATACACCAT TCTTCAATAA TTATCGACCA CAATTTTATT 700 

TCCGCACAAC TGACGTAACA GGAACAATTG TTTTGCCAGA AGGAACGGAA 750 

ATGGTCATGC CTGGCGACAA CGTAACGATC GATGTAGAAT TGATTCATCC 800 

30 TGTTGCTATT GAAAACGGGA CGACTTTCTC CA 832 



35 



2 ) INFORMATION FOR SEQ ID NO: 988 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 835 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
40 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
45 (A) ORGANISM: Enterococcus mundtii 

(B) STRAIN: ATCC 43186 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 988 

50 TGGTGCGATC TTAGTTGTTT CTGCGACAGA TGGACCAATG CCTCAAACTC 50 

GCGAGCATAT CCTTTTATCA CGTCAAGTTG GTGTAAAATA TTTGATTGTA 100 

TTTTTGAATA AAGTTGATTT GGTCGATGAT GAAGAATTGA TCGATCTTGT 150 

AGAAATGGAA GTTCGTGAAT TACTGAATGA ATATGGTTTC CCAGGTGATG 200 

ACACACCTGT CATCAAAGGC TCTGCATTGA AAGCATTGCA AGGTGACCCA 250 

55 GAAGCAGAAG CTGCGATCAA TGAATTGATG GAAACAGTGG ATGACTATAT 300 

CCCAACACCA GAACGTGATA CCGACAAACC ATTGCTTTTA CCAGTTGAAG 350 

ATGTTTTCTC AATCACTGGT CGCGGAACGG TAGCATCTGG TCGTATCGAC 400 

CGTGGAGCCG TTCGTGTCGG TGATGAAATC GAGATCATCG GAATCAAACC 450 

TGAAACGAAA AAAGCGGTTG TGACAGGGGT AGAGATGTTC CGTAAAACGT 500 

60 TAGATTATGG CGAAGCTGGA GATAACGTAG GAATCTTGTT ACGTGGTATC 550 

550 
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CAAAGAGAAG ATATTGAACG TGGACAAGTA 
CACACCACAT ACAAAATTCA AAGCGGAAGT 
AAGGCGGACG TCATACACCA TTCTTCAATA 
TTCCGCACAA CAGATGTAAC AGGTACGATC 
5 AATGGTCATG CCTGGAGACA ACGTAACCAT 
CAGTGGCAAT CGAACAAGGA ACGACTTTCT 



ATTGCGAAAC CTGGTTCAAT 600 

TTATGTATTG ACGAAAGAAG 650 

ACTACCGCCC ACAATTTTAT 700 

GTGTTGCCAG AAGGAACAGA 750 

CGAGGTAGAG TTGATCCATC 800 

CTATT 835 



10 2) INFORMATION FOR SEQ ID NO: 989 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 832 bases 

(B) TYPE: Nucleic acid 
15 (C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

20 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus raffinosus 

(B) STRAIN: ATCC 4 942 7 



25 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 989 



TGCGATCTTA GTAGTATCTA CTACTGATGG ACCAATGCCT CAAACACGTG 50 

AGCACATTTT GTTATCACGC CAAGTAGGTG TTAAATACTT GATCGTCTTC 100 

TTGAACAAAG TTGATTTAGT CGACGATGAA GAATTGATTG ATTTAGTTGA 150 

AATGGAAGTA CGTGAGTTAC TTTCAGAATA TGGTTTCCCA GGCGATGATA 2 00 

30 TTCCTGTTCT TAAAGGTTCA GCTCTGAAAG CTTTAGAAGG CGATCCTGAA 250 

CAAGAACAAG TAATCATGGA CCTAATGGAT ACGGTTGACG AATATATCCC 3 00 

AACACCAGAA CGTGACACTG ACAAACCATT CTTGTTACCA GTGGAAGATG 350 

TTTTCTCAAT CACAGGACGT GGGACTGTTG CATCTGGTCG TATTGATCGT 400 

GGGGAAGTTA AAGTCGGTGA CGAAGTTGAA ATTATCGGGA TCAAACCTGA 450 

35 AGTTCAAAAG GCTGTCGTAA CTGGACTTGA AATGTTCCGT AAAACATTGG 500 

ATTATGGTGA AGCTGGAGAT AACGTTGGGG TTCTATTACG TGGTATTACA 550 

CGTGATGAAA TCGAACGTGG TCAAGTATTA GCTAAACCAG GTTCAATTAC 600 

ACCACATACG AAATTCAGTG CAGAAGTTTA TGTGTTGACG AAAGAAGAAG 650 

GTGGACGTCA TACACCATTC TTTAACAACT ATCGTCCTCA ATTCTACTTC 700 

40 CGTACAACAG ACGTTACCGG TAATATCGTG TTGCCAGAAG GTACTGAAAT 750 

GGTCATGCCT GGCGATAACG TAACAATCGA CGTTGAATTA ATCCATCCAA 800 

TCGCCGTAGA AAAAGGAACA ACTTTCTCTA TT 832 



45 



55 



2) INFORMATION FOR SEQ ID NO: 990 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 154 bases 
50 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bacillus anthracis 

(B) STRAIN: CIP 9440 

60 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 990 

551 
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GGATCCTGTA TATGCACAAA AACTAGGTGT 
TATCACAACC TGATACAGGG GAGCAAGGTT 
GTACGAAGTG GTGCGGTTGA TATTATCGTA 
5 TGTA 



TAACATCGAT GAATTACTAT 50 
TAGAAATCGC AGAAGCACTT 100 
ATTGACTCTG TAGCAGCTCT 150 

154 



10 



2 ) INFORMATION FOR SEQ ID NO: 991 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 384 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 
15 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
20 (A) ORGANISM: Prevotella melaninogenica 

(B) STRAIN: ATCC 25845 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 991 

25 GCCATTGCAG AGGCACAGAA GCAGGGCGGT ATTGCAGCCT TCATTGATGC 50 

TGAGCACGCC TTCGACCGTT TCTATGCAGA GAAGTTAGGT GTGGATGTTG 100 

ATAACCTTTG GGTTTCACAG CCAGACAATG GTGAGCAGGC TTTAGAGATT 150 

GCCGACCAGC TGATTCGCTC TTCCGCTATT GACATTCTCG TTGTCGACTC 200 

AGTTGCAGCC TTGACTCCAA AGAAGGAGAT TGAGGGTGAC ATGGGTGACT 250 

30 CTGCAGTAGG TTTACAAGCA CGACTGATGA GTCAGGCATT GCGTAAACTT 300 

ACCTCAACAA TCGCAAAAAC TAATACTTGC TGCATCTTCA TCAACCAGTT 350 

GCGTGAGAAG ATTGGTGTGA TGTTTGGTAA TCCA 384 



35 



45 



2) INFORMATION FOR SEQ ID NO: 992 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 624 bases 
40 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus casseliflavus 

(B) STRAIN: R760 

50 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 992 

ACACGTGAAC ACATCTTGTT ATCACGTAAC GTTGGTGTAC CATACATCGT 50 

TGTTTTCTTA AACAAAATGG ATATGGTTGA TGACGAAGAA TTACTAGAAT 100 

TAGTTGAAAT GGAAGTTCGT GACTTATTGT CAGAATATGA CTTCCCAGGC 150 

55 GACGATGTTC CTGTAATCGC TGGTTCTGCT TTGAAAGCTC TTGAAGGCGA 200 

TGCTTCATAC GAAGAAAAAA TCATGGAATT AATGGCTGCA GTTGACGAAT 250 

ACGTTCCAAC TCCAGAACGT GACACTGACA AACCATTCAT GATGCCAGTC 3 00 

GAAGACGTAT TCTCAATCAC TGGACGTGGT ACTGTTGCTA CAGGCCGTGT 350 

TGAACGTGGA CAAGTTCGCG TTGGTGACGA AGTTGAAATC GTTGGTATTG 400 

60 CTGAAGAAAC TGCTAAAACA ACTGTAACTG GTGTTGAAAT GTTCCGTAAA 450 
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TTGTTAGACT ATGCTQAAGC AGGGGATAAC 
TGTTGCTCGT GAAGACATCC AACGTGGACA 
CAATCACACC TCATACAAAA TTTAAAGCTG 
GAAGAAGGTG GACGTCATAC ACCA 



ATTGGTGCAT TGCTACGTGG 500 
AGTATTGGCT AAAGCTGGTA 550 
AAGTTTACGT TTTAACAAAA 600 

624 



2) INFORMATION FOR SEQ ID NO: 993 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 756 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY : Linear 

(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus pyogenes 

(C) ACCESSION NUMBER: extracted from U40453 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 993 

ATGGAAAACA ATAAAAAAGT ATTGAAGAAA ATGGTATTTT TTGTTTTAGT 50 

GACATTTCTT GGACTAACAA TCTCGCAAGA GGTATTTGCT CAACAAGACC 100 

CCGATCCAAG CCAACTTCAC AGATCTAGTT TAGTTAAAAA CCTTCAAAAT 150 

ATATATTTTC TTTATGAGGG TGACCCTGTT ACTCACGAGA ATGTGAAATC 2 00 

TGTTGATCAA CTTTTATCTC ACGATTTAAT ATATAATGTT TCAGGGCCAA 250 

ATTATGATAA ATTAAAAACT GAACTTAAGA ACCAAGAGAT GGCAACTTTA 3 00 

TTTAAGGATA AAAACGTTGA TATTTATGGT GTAGAATATT ACCATCTCTG 350 

TTATTTATGT GAAAATGCAG AAAGGAGTGC ATGTATCTAC GGAGGGGTAA 400 

CAAATCATGA AGGGAATCAT TTAGAAATTC CTAAAAAGAT AGTCGTTAAA 450 

GTATCAATCG ATGGTATCCA AAGCCTATCA TTTGATATTG AAACAAATAA 500 

AAAAATGGTA ACTGCTCAAG AATTAGACTA TAAAGTTAGA AAATATCTTA 550 

CAGATAATAA GCAACTATAT ACTAATGGAC CTTCTAAATA TGAAACTGGA 600 

TATATAAAGT TCATACCTAA GAATAAAGAA AGTTTTTGGT TTGATTTTTT 650 

CCCTGAACCA GAATTTACTC AATCTAAATA TCTTATGATA TATAAAGATA 700 

ATGAAACGCT TGACTCAAAC ACAAGCCAAA TTGAAGTCTA CCTAACAACC 750 

AAGTAA 756 



2) INFORMATION FOR SEQ ID NO: 994 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 994 
TGGACTAACA ATCTCGCAAG AGG 



2) INFORMATION FOR SEQ ID NO: 995 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 995 
ACATTCTCGT GAGTAACAGG GT 



2 ) INFORMATION FOR SEQ ID NO: 996 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 996 
ACAAATCATG AAGGGAATCA TTTAG 



2 ) INFORMATION FOR SEQ ID NO: 997 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 997 
CTAATTCTTG AGCAGTTACC ATT 



2) INFORMATION FOR SEQ ID NO: 998 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 998 
GGAGGGGTAA CAAATCATGA AGG 



554 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 



PCT/CAOO/01150 



10- 



20 



2) INFORMATION FOR SEQ ID NO: 999 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 999 
15 TTGACCTTGT TGATGACGAA GAG 23 



2) INFORMATION FOR SEQ ID NO: 1000 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 
<C) STRANDEDNESS: Single 

25 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1000 
TTAGTGTGTG GGTTGATTGA ACT -« 



35 2) INFORMATION FOR SEQ ID NO: 1001 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 
40 (C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 
45 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1001 

AAGAGTTGCT TGAATTAGTT GAG 23 



50 

2) INFORMATION FOR SEQ ID NO: 1002 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 894 bases 
55. (B) TYPE: Nucleic acid 

(C) STRANDEDNESS : * Double 

(D) TOPOLOGY: Linear 



60 



(ii) MOLECULE TYPE: Genomic DNA 

555 
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(vi) ORIGINAL SOURCE: 

(A) ORGANISM : Streptococcus pyogenes 

(B) STRAIN: ATCC 7002 94 

5 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10 02 

AACATGATCA CTGGTGCCGC TCAAATGGAC GGAGCTATCC TTGTAGTTGC 50 

TTCAACTGAT GGACCAATGC CACAAACTCG TGAGCACATC CTTCTTTCAC 100 

GTCAGGTTGG TGTTAAACAC CTTATCGTGT TCATGAACAA AGTTGACCTT 150 

10 GTTGATGACG AAGAGTTGCT TGAATTAGTT GAGATGGAAA TTCGTGACCT 2 00 

TCTTTCAGAA TACGATTTCC CAGGTGATGA CCTTCCAGTT ATCCAAGGTT 250 

CAGCTCTTAA AGCTCTTGAA GGCGACACTA AATTTGAAGA CATCATCATG 3 00 

GAATTGATGG ATACTGTTGA TTCATACATT CCAGAACCAG AACGCGACAC 350 

TGACAAACCA TTGCTTCTTC CAGTCGAAGA CGTATTCTCA ATTACAGGTC 400 

15 GTGGTACAGT TGCTTCAGGA CGTATCGACC GTGGTACTGT TCGTGTCAAC 450 

GACGAAATCG AAAT CGTTGG TATCAAAGAA GAAACTAAAA AAGCTGTTGT 500 

TACTGGTGTT GAAATGTTCC GTAAACAACT TGACGAAGGT CTTGCAGGAG 550 

ACAACGTAGG TATCCTTCTT CGTGGTGTTC AACGTGACGA AATCGAACGT 600 

GGTCAAGTTA TTGCTAAACC AAGTTCAATC AACCCACACA CTAAATTCAA 650 

2 0 AGGTGAAGTA TATATCCTTT CTAAAGACGA AGGTGGACGT CACACTCCAT 700 

TCTTCAACAA CTACCGTCCA CAATTCTACT TCCGTACAAC TGACGTAACA 750 

GGTTCAATCG AACTTCCAGC AGGTACAGAA ATGGTTATGC CTGGTGATAA 800 

CGTGACAATC AACGTTGAGT TGATCCACCC AATCGCCGTA GAACAAGGTA 850 

CTACTTTCTC AATC CGTGAA GGTGGACGTA CTGTTGGTTC AGGT 894 

25 



2) INFORMATION FOR SEQ ID NO: 1003 

3 0 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 332 bases 

(B) TYPE : Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



35 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bacillus cereus 
40 (B) STRAIN: ATCC 14579 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1003 

AATGGATCCT GTATATGCAC AAAAATTAGG CGTTAACATA GATGAATTAC 50 

45 TATTATCACA GCCTGATACA GGGGAGCAAG GATTAGAAAT CGCGGAAGCA 100 

CTTGTACGAA GTGGTGCGGT TGACATTATC GTAATTGACT CTGTAGCAGC 150 

TCTTGTACCG AAAGCAGAGA TTGAAGGCGA CATGGGTGAC TCACACGTAG 200 

GTTTACAAGC ACGTTTAATG TCACAAGCAC TTCGTAAGCT TTCAGGAGCA 250 

ATCAACAAAT CAAAAACAAT TGCAATCTTT ATTAACCAAA TTCGTGAAAA 300 

50 AGTTGGGGTT ATGTTCGGAA ACCCAGAAAC AA 332 



2 ) INFORMATION FOR SEQ ID NO: 1004 

55 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1212 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
60 (D) TOPOLOGY: Linear 
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(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
5 (A) ORGANISM: Streptococcus pneumoniae 

(B) STRAIN: StrR-01 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1004 

10 ACCAAGAAGC TCAAAAACAT CTGTGGGATA TCTACAACTC CGATCAATAC 50 

GTCTCTTACC CTGACGATGA TTTGCAAGTC GCATCTACGG TCGTAGATGT 100 

TTCAAATGGT AAAGTCATCG CCCAACTTGG AGCTCGTCAC CAAGCAAGTA 150 

ACGTTTCATT TGGTACCAAC CAAGCTGTGG AAACCAATCG TGACTGGGGT 200 

TCTGCTATGA AACCAATCAC CGATTATGCA CCTGCCATAG AATACGGTGT 250 

15 TTATGATTCC ACTGCAACTA TGGTTAATGA TATTCCTTAT AACTATCCGG 300 

GAACAAGCAC ACCTGTCTAC AACTGGGATA GAGCATATTT CGGTAATATT 350 

ACTCTGCAAT ATGCTCTTCA ACAATCACGA AATGTCACAG CCGTTGAGAC 400 

TTTGAATAAG GTCGGTCTAG ATAGAGCTAA AACCTTCCTT AATGGTCTTG 450 

GTATCGACTA TCCAAGCATG CATTATGCAA ACGCCATTTC AAGTAATACA 500 

20 ACAGAATCTA ATAAACAATA CGGAGCAAGT AGTGAAAAAA TGGCTGCTGC 550 

TTATGCTGCC TTTGCAAATG GTGGCACTTA CTATAAACCA ATGTATATCC 600 

ATAAAGTCGT CTTCAGTGAT GGAAGTAAAA AAGAGTTCTC TAATGTCGGA 650 

ACTCGTGCCA TGAAGGAAAC GACAGCCTAT ATGATGACCG ACATGATGAA 700 

AACAGTCTTG ACTTATGGAA CTGGGCGTGG AGCCTATCTT CCTTGGCTTC 750 

25 CTCAAGCTGG TAAAACAGGA ACCTCTAACT ATACAGATGA GGAAGTTGAA 800 

AAC CACATCA AGAACACTGG CTATGTAGCT CCAGATGAAA TGTTTGTTGG 850 

TTATACTCGT AAGTATTCTA TGGCTGTATG GACAGGTTAT TCGAATCGTT 900 

TAACTCCTAT CGTTGGAGAT GGTTTCCTAG TTGCAGCTAA AGTTTATCGC 950 

TCAATGATAA CGTATCTATC AGAAGATACT CATCCAGAAG ACTGGACGAT 1000 

30 GCCAGACGGA CTTTTCAGAA ACGGGGAATT TGTATTCAAA AATGGAGCTC 1050 

GCCCAATATG GACTGAACCC TCTACTCAAC AATCCTCAAC AGCTGAAAGT 1100 

TCAAGCTCAT CATCAGATAG TTCAACTTCA CAGTCTAGCT CAACCACTCC 115 0 

AAGCACAAAT AATAGTACGA CTACCAATCC TAACAATAAT ACGCAACAAT 1200 

CAAATACAAC CC 1212 

35 

2 ) INFORMATION FOR SEQ ID NO: 1005 

40 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1212 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



45 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus pneumoniae 
50 (B) STRAIN: StrR-02 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1005 

AAGCTCAAAA ACATCTGTGG GATATCTACA ACTCCGATCA ATACGTCTCT 50 

55 TACCCTGACG ATGATTTGCA AGTCGCATCT ACGGTCGTAG ATGTTTCAAA 100 

TGGTAAAGTC ATCGCCCAAC TTGGAGCTCG TCACCAAGCA AGTAACGTTT 150 

CATTTGGTAC CAACCAAGCT GTGGAAACCA ATCGTGACTG GGGTTCTGCT 200 

ATGAAACCAA TCACCGATTA TGCACCTGCC ATAGAATACG GTGTTTATGA 250 

TTCCACTGCA ACTATGGTTA ATGATATTCC TTATAACTAT CCGGGAACAA 300 

60 GCACACCTGT CTACAACTGG GATAGAGCAT ATTTCGGTAA TATTACTCTG 350 

557 
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CAATATGCTC TTCAACAATC ACGAAATGTC 
TAAGGTCGGT CTAGATAGAG CTAAAACCTT 
ACTATCCAAG CATGCATTAT GCAAACGCCA 
TCTAATAAAC AATACGGAGC AAGTAGTGAA 
TGCCTTTGCC AACGGTGGAA TTTACCACAA 
TCGTCTTCAG TGACGGTAGT AAAAAAGAAT 
GCTATGAAAG AAACAACTGC TTACATGATG 
CTTGGCATAC GGAACTGGTC GTGGAGCCTA 
CTGGTAAGAC AGGTACTTCT AACTACACAG 
ATCAAGAACA CTGGCTATGT AGCTCCAGAT 
TCGTAAGTAT TCTATGGCTG TATGGACAGG 
CTATCGTTGG AGATGGTTTC CTAGTTGCAG 
ATAACGTATC TGTCTGAAGG AAGCAATCCA 
GGGGCTCTAC AGAAATGGAG AATTCGTATT 
CGTGGAGCTC ACCTGCTGCA CAACAACCCC 
TCATCATCAG ATAGTTCAAC TTCACAGTCT 
AAATAATAGT ACGACTACCA ATCCTAACAA 
CAACCCCTGA TC 



ACAGCCGTTG AGACTTTGAA 400 

CCTTAATGGT CTTGGTATCG 450 

TTTCAAGTAA TACAACAGAA 500 

AAAATGGCTG CTGCTTATGC 550 

ACCCATGTAT ATCAATAAGG 60 0 

TTTCAGATGT AGGTACACGA 650 

ACCGAAATGA TGAAAACTGT 700 

TCTCCCATGG TTAGCGCAAG 75 0 

ATGATGAAAT TGAAAAACAC 8 00 

GAAATGTTTG TTGGTTATAC 850 

TTATTCGAAT CGTTTAACTC 900 

CTAAAGTTTA TCGCTCAATG 950 

GAGGATTGGA ATATACCAGA 1000 

TAAAAATGGT GCTCGTTCTA 1050 

CATCAACTGA AAGTTCAAGC 110 0 

AACTCAACCA CTCCAAGCAC 1150 

TAATACGCAA CAATCAAATA 1200 

1212 



2) INFORMATION FOR SEQ ID NO : 1006 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1213 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus pneumoniae 

(B) STRAIN: StrR-03 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1006 



CGATCAATAC 


50 


TCGTAGATGT 


100 


CAAGCAAGTA 


150 


TGACTGGGGT 


200 


AATACGGTGT 


250 


AACTATCCGG 


300 


CGGTAATATT 


350 


CCGTTGAGAC 


400 


AATGGTCTTG 


450 


AAGTAATACA 


500 


TGGCTGCTGC 


550 


ATGTATATCA 


600 


AGATGTAGGT 


650 


AAATGATGAA 


700 


CCATGGTTAG 


750 


TGAAATTGAA 


800 


TGTTTGTTGG 


850 


TCGAATCGTT 


900 


AGTTTATCGC 


950 


ATTGGAATAT 


1000 


AATGGTGCTC 


1050 


AACTGAAAGT 


1100 


CAACCACTCC 


1150 
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CAAATACAAC CCC 1213 



2 ) INFORMATION FOR SEQ ID NO: 1007 



10 



15 



20 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1218 bases 

(B) . TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus pneumoniae 

(B) STRAIN: StrR-04 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1007 



AC CAAGAAGC 
GTCTCTTACC 
TTCAAATGGT 

25 ACGTTTCATT 
TCTGCTATGA 
TTATGATTCC 
GAACAAGCAC 
ACTCTGCAAT 

3 0 TTTGAATAAG 
GTATCGACTA 
ACAGAATCTA 
TTATGCTGCC 
ATAAAGTCGT 

35 ACTCGTGCCA 
AACAGTCTTG 
CTCAAGCTGG 
AACCACATCA 
TTATACTCGT 

40 TAACTCCTAT 
TCAATGATAA 
GCCAGACGGA 
GCCCAATATG 
TCAAGCTCAT 

45 AAGCACAAAT 
CAAATACAAC 



TCAAAAACAT 

CTGACGATGA 

AAAGTCATCG 

TGGTACCAAC 

AACCAATCAC 

ACTGCAACTA 

ACCTGTCTAC 

ATGCTCTTCA 

GTCGGTCTAG 

TCCAAGCATG 

ATAAACAATA 

TTTGCAAATG 

CTTCAGTGAT 

TGAAGGAAAC 

ACTTATGGAA 

TAAAACAGGA 

AGAACACTGG 

AAGTATTCTA 

CGTTGGAGAT 

CGTATCTATC 

CTTTTCAGAA 

GACTGAACCC 

CATCAGATAG 

AATAGTACGA 

CCCTGATC 



CTGTGGGATA 
TTTGCAAGTC 
CCCAACTTGG 
CAAGCTGTGG 
CGATTATGCA 
TGGTTAATGA 
AACTGGGATA 
ACAATCACGA 
ATAGAGCTAA 
CATTATGCAA 
CGGAGCAAGT 
GTGGCACTTA 
GGAAGTAAAA 
GACAGCCTAT 
CTGGGCGTGG 
ACCTCTAACT 
CTATGTAGCT 
TGGCTGTATG 
GGTTTCCTAG 
AGAAGATACT 
ACGGGGAATT 
TCTACTCAAC 
TTCAACTTCA 
CTACCAATCC 



TCTACAACTC 
GCATCTACGG 
AGCTCGTCAC 
AAACCAATCG 
CCTGCCATAG 
TATTCCTTAT 
GAGCATATTT 
AATGTCACAG 
AACCTTCCTT 
ACGCCATTTC 
AGTGAAAAAA 
CTATAAACCA 
AAGAGTTCTC 
ATGATGACCG 
AGCCTATCTT 
ATACAGATGA 
CCAGATGAAA 
GACAGGTTAT 
TTGCAGCTAA 
CATCCAGAAG 
TGTATTCAAA 
AATCCTCAAC 
CAGTCTAGCT 
TAACAATAAT 



CGATCAATAC 50 

TCGTAGATGT 100 

CAAGCAAGTA 150 

TGACTGGGGT 200 

AATACGGTGT 250 

AACTATCCGG 3 00 

CGGTAATATT 350 

CCGTTGAGAC 400 

AATGGTCTTG 450 

AAGTAATACA 500 

TGGCTGCTGC 550 

ATGTATATCC 600 

TAATGTCGGA 650 

ACATGATGAA 700 

CCTTGGCTTC 750 

GGAAGTTGAA 800 

TGTTTGTTGG 850 

TCGAATCGTT 900 

AGTTTATCGC 950 

ACTGGACGAT 1000 

AATGGAGCTC 1050 

AGCTGAAAGT 1100 

CAACCACTCC 1150 

ACGCAACAAT 1200 
1218 



50 2) INFORMATION FOR SEQ ID NO: 1008 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1223 bases 

(B) TYPE: Nucleic acid 
55 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 
60 (vi) ORIGINAL SOURCE: 
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(A) ORGANISM: Streptococcus pneumoniae 

(B) STRAIN: StrR-05 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1008 

5 

GTAGACCAAG AAGCTCAAAA ACATCTGTGG GATATCTACA ACTCCGATCA 50 

ATACGTCTCT TACCCTGACG ATGATTTGCA AGTCGCATCT ACGGTCGTAG 100 

ATGTTTCAAA TGGTAAAGTC ATCGCCCAAC TTGGAGCTCG TCACCAAGCA 150 

AGTAACGTTT CATTTGGTAC CAACCAAGCT GTGGAAACCA ATCGTGACTG 200 

10 GGGTTCTGCT ATGAAACCAA TCACCGATTA TGCACCTGCC ATAGAATACG 250 

GTGTTTATGA TTCCACTGCA ACTATGGTTA ATGATATTCC TTATAACTAT 300 

CCGGGAACAA GCACACCTGT CTACAACTGG GATAGAGCAT ATTTCGGTAA 350 

TATTACTCTG CAATATGCTC TTCAACAATC ACGAAATGTC ACAGCCGTTG 400 

AGACTTTGAA ' TAAGGTCGGT CTAGATAGAG CTAAAACCTT CCTTAATGGT 450 

15 CTTGGTATCG ACTATCCAAG CATGCATTAT GCAAACGCCA TTTCAAGTAA 500 

TACAACAGAA TCTAATAAAC AATACGGAGC AAGTAGTGAA AAAATGGCTG 550 

CTGCTTATGC TGCCTTTGCA AATGGTGGCA CTTACTATAA ACCAATGTAT 600 

ATCCATAAAG TCGTCTTCAG TGATGGAAGT AAAAAAGAGT TCTCTAATGT 650 

CGGAACTCGT GCCATGAAGG AAACGACAGC CTATATGATG ACCGACATGA 700 

2 0 TGAAAACAGT CTTGACTTAT GGAACTGGGC GTGGAGCCTA TCTTCCTTGG 750 

CTTCCTCAAG CTGGTAAAAC AGGAACCTCT AACTATACAG ATGAGGAAGT 800 

TGAAAACCAC ATCAAGAACA CTGGCTATGT AGCTCCAGAT GAAATGTTTG 850 

TTGGTTATAC TCGTAAGTAT TCTATGGCTG TATGGACAGG TTATTCGAAT 900 

CGTTTAACTC CTATCGTTGG AGATGGTTTC CTAGTTGCAG CTAAAGTTTA 950 

25 TCGCTCAATG ATAACGTATC TATCAGAAGA TACTCATCCA GAAGACTGGA 1000 

CGATGCCAGA CGGACTTTTC AGAAACGGGG AATTTGTATT CAAAAATGGA 1050 

GCTCGCCCAA TATGGACTGA ACCCTCTACT CAACAATCCT CAACAGCTGA 1100 

AAGTTCAAGC TCATCATCAG ATAGTTCAAC TTCACAGTCT AGCTCAACCA 1150 

CTCCAAGCAC AAATAATAGT ACGACTACCA ATCCTAACAA TAATACGCAA 1200 

30 CAATCAAATA CAACCCCTGA TCA 1223 



2) INFORMATION FOR SEQ ID NO: 1009 



35 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1214 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
40 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
45 (A) ORGANISM: Streptococcus pneumoniae 

(B) STRAIN: StrR-06 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1009 

50 ACCAAGAAGC TCAAAAACAT CTGTGGGATA TCTACAACTC CGATCAATAC 50 

GTCTCTTACC CTGACGATGA TTTGCAAGTC GCATCTACGG TCGTAGATGT 100 

TTCAAATGGT AAAGTCATCG CCCAACTTGG AGCTCGTCAC CAAGCAAGTA 150 

ACGTTTCATT TGGTACCAAC CAAGCTGTGG AAACCAATCG TGACTGGGGT 2 00 

TCTGCTATGA AACCAATCAC CGATTATGCA CCTGCCATAG AATACGGTGT 250 

55 TTATGATTCC ACTGCAACTA TGGTTAATGA TATTCCTTAT AACTATCCGG 3 00 

GAACAAGCAC ACCTGTCTAC AACTGGGATA GAGCATATTT CGGTAATATT 350 

ACTCTGCAAT ATGCTCTTCA ACAAT CACG A AATGTCACAG CCGTTGAGAC 400 

TTTGAATAAG GTCGGTCTAG ATAGAGCTAA AACCTTCCTT AATGGTCTTG 450 

GTATCGACTA TCCAAGCATG CATTATGCAA ACGCCATTTC AAGTAATACA 500 

60 ACAGAATCTA ATAAACAATA CGGAGCAAGT AGTGAAAAAA TGGCTGCTGC 550 
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TTATGCTGCC TTTGCAAATG GTGGCACTTA CTATAAACCA ATGTATATCC 600 

ATAAAGTCGT CTTCAGTGAT GGAAGTAAAA AAGAGTTCTC TAATGTCGGA 650 

ACTCGTGCCA TGAAGGAAAC GACAGCCTAT ATGATGACCG ACATGATGAA 700 

AACAGTCTTG ACTTATGGAA CTGGGCGTGG AGCCTATCTT CCTTGGCTTC 750 

5 CTCAAGCTGG TAAAACAGGA ACCTCTAACT ATACAGATGA GGAAGTTGAA 800 

AACCACATCA AGAACACTGG CTATGTAGCT CCAGATGAAA TGTTTGTTGG 850 

TTATACTCGT AAGTATTCTA TGGCTGTATG GAGAGGTTAT TCGAATCGTT 900 

TAACTCCTAT CGTTGGAGAT GGTTTCCTAG TTGCAGCTAA AGTTTATCGC 950 

TCAATGATAA CGTATCTATC AGAAGATACT CATCCAGAAG ACTGGACGAT 1000 

10 GCCAGACGGA CTTTTCAGAA ACGGGGAATT TGTATTCAAA AATGGAGCTC 1050 

GCCCAATATG GACTGAACCC TCTACTCAAC AATCCTCAAC AGCTGAAAGT 1100 

TCAAGCTCAT CATCAGATAG TTCAACTTCA CAGTCTAGCT CAACCACTCC 1150 

AAGCACAAAT AATAGTACGA CTACCAATCC TAACAATAAT ACGCAACAAT 12 00 

CAAATACAAC CCCT ^ 12 i4 

2 ) INFORMATION FOR SEQ ID NO: 1010 

20 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1223 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



25 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus pneumoniae 

3 0 (B) STRAIN: StrR-07 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1010 

CCAAGAAGCT CAAAAACATC TGTGGGATAT CTACAACTCC GATCAATACG 50 

35 TCTCTTACCC TGACGATGAT TTGCAAGTCG CATCTACGGT CGTAGATGTT 100 

TCAAATGGTA AAGTCATCGC ACAACTTGGT GCTCGTCATC AAGCAAGTAA 150 

TGTTTCATTC GGTACCAACC AGGCCGTAGA AACCAATCGT GACTGGGGAT 2 00 

CATCAATGAA ACCAATCACT GACTATGCTC CCGCTTTAGA ATATGGAGTC 250 

TATGACTCTA CTGCTTCTAT TGTACATGAT GTCCCTTATA ACTATCCTGG 300 

4 0 CACTGATACT CCACTCTACA ACTGGGATCA TGTCTACTTT GGAAACATTA 350 

CAATCCAGTA TGCTCTTCAA CAATCACGAA ATGTCACAGC CGTTGAGACT 400 

TTGAATAAGG TCGGTCTAGA TAGAGCTAAA ACCTTCCTTA ATGGTCTTGG 450 

TATCGACTAT CCAAGCATGC ATTATGCAAA CGCCATTTCA AGTAACACAA 500 

CTGAATCCAA CAAAAAATAT GGTGCAAGTA GTGAAAAAAT GGCTGCTGCC 550 

45 TACGCTGCTT TTGCTAATGG TGGTATTTAT CACAAACCAA TGTATATCAA 600 

TAAAATCGTC TTTAGTGATG GTAGCGAAAA AGAATTTTCT GATGCTGGTA 650 

CACGAGCTAT GAAAGAGACT ACTGCCTATA TGATGACTGA AATGATGAAA 700 

ACTGTTTTAA CTTACGGAAC AGGACGTGGA GCCTACCTAC CATGGCTTCC 750 

ACAAGCAGGT AAGACAGGTA CTTCTAACTA TACTGACGAA GAAATTGAAA 800 

50 AGTATATCAA GAACACTGGT TACGTAGCTC CAGATGAAAT GTTTGTAGGG 850 

* TATACCCGTA AATATGCAAT GGCTGTTTGG ACAGGATACT CAAATCGTCT 900 

AACTCCAATC ATCGGAGATG GTTTCCTTGT TGCTGGTAAA GTCTATCGTT 950 

CAATGATAAC TTACCTTTCT GAAGATGACC AACCTGGAGA TTGGACAATG 1000 

CCAGATGGCT TGTATAGAAA TGGAGAATTC GTATTTAAAA ATGGTGCTCG 1050 

55 TTCTACGTGG AGCTCACCTG CTCCACAACA ACCCCCATCA ACTGAAAGTT 1100 

CAAGCTCATC ATCAGATAGT TCAACTTCAC AGTCTAACTC AACCACTCCA 1150 

AGCACAAATA ATAGTACGAC TACCAATCCT AACAATAATA CGCAACAATC 1200 

AAATACAACC CCTGATCAAC AAA 1223 
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2) INFORMATION FOR SEQ ID NO: 1011 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 07 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus pneumoniae 

(B) STRAIN: StrR-08 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1011 



AAGCTCAAAA ACATCTGTGG 
TACCCTGACG ATGATTTGCA 
TGGTAAAGTC ATCGCCCAAC 
CATTTGGTAC CAACCAAGCT 
ATGAAACCAA TCACCGATTA 
TTCCACTGCA ACTATGGTTA 
GCACACCTGT CTACAACTGG 
CAATATGCTC TTCAACAATC 
TAAGGTCGGT CTAGATAGAG 
ACTATCCAAG CATGCATTAT 
TCTAATAAAC AATACGGAGC 
TGCCTTTGCA AATGGTGGCA 
TCGTCTTCAG TGATGGAAGT 
GCCATGAAGG AAACGACAGC 
CTTGACTTAT GGAACTGGGC 
CTGGTAAAAC AGGAACCTCT 
ATCAAGAACA CTGGCTATGT 
TCGTAAGTAT TCTATGGCTG 
CTATCGTTGG AGATGGTTTC 
ATAACGTATC TATCAGAAGA 
CGGACTTTTC AGAAACGGGG 
TATGGACTGA ACCCTCTACT 
TCATCATCAG ATAGTTCAAC 
AAATAATAGT ACGACTACCA 
CAACCCC 



GATATCTACA ACTCCGATCA 
AGTCGCATCT ACGGTCGTAG 
TTGGAGCTCG TCACCAAGCA 
GTGGAAACCA ATCGTGACTG 
TGCACCTGCC ATAGAATACG 
ATGATATTCC TTATAACTAT 
GATAGAGCAT ATTTCGGTAA 
ACGAAATGTC ACAGCCGTTG 
CTAAAACCTT CCTTAATGGT 
GCAAACGCCA TTTCAAGTAA 
AAGTAGTGAA AAAATGGCTG 
CTTACTATAA ACCAATGTAT 
AAAAAAGAGT TCTCTAATGT 
CTATATGATG ACCGACATGA 
GTGGAGCCTA TCTTCCTTGG 
AACTATACAG ATGAGGAAGT 
AGCTCCAGAT GAAATGTTTG 
TATGGACAGG TTATTCGAAT 
CTAGTTGCAG CTAAAGTTTA 
TACTCATCCA GAAGACTGGA 
AATTTGTATT CAAAAATGGA 
CAACAATCCT CAACAGCTGA 
TTCACAGTCT AGCTCAACCA 
ATCCTAACAA TAATACGCAA 



7\ t j\ r , n r rc tr vr %r r 
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inn 
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GGGTTCTGCT 


200 


GTGTTTATGA 


250 


CCGGGAACAA 


300 


TATTACTCTG 


350 


AGACTTTGAA 


400 


CTTGGTATCG 


450 


TACAACAGAA 


500 


CTGCTTATGC 


550 


ATCCATAAAG 


600 


CGGAACTCGT 


650 


TGAAAACAGT 


700 


CTTCCTCAAG 


750 


TGAAAACCAC 


800 


TTGGTTATAC 


850 


CGTTTAACTC 


900 


TCGCTCAATG 


950 


CGATGCCAGA 


1000 


GCTCGCCCAA 


1050 


AAGTTCAAGC 


1100 


CTCCAAGCAC 


1150 


CAATCAAATA 


1200 




1207 



2 ) INFORMATION FOR SEQ ID NO: 1012 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1201 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus pneumoniae 

(B) STRAIN: StrR-09 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1012 
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AAGCTCAAAA ACATCTGTGG GATATCTACA ACTCCGATCA ATACGTCTCT 50 

TACCCTGACG ATGATTTGCA AGTCGCATCT ACGGTCGTAG ATGTTTCAAA 100 

TGGTAAAGTC ATCGCCCAAC TTGGAGCTCG TCACCAAGCA AGTAACGTTT 150 

5 CATTTGGTAC CAACCAAGCT GTGGAAACCA ATCGTGACTG GGGTTCTGCT 2 00 

ATGAAACCAA TCACCGATTA TGCACCTGCC ATAGAATACG GTGTTTATGA 250 

TTCCACTGCA ACTATGGTTA ATGATATTCC TTATAACTAT CCGGGAACAA 30 0 

GCACACCTGT CTACAACTGG GATAGAGCAT ATTTCGGTAA TATTACTCTG 350 

CAATATGCTC TTCAACAATC ACGAAATGTC ACAGCCGTTG AGACTTTGAA 40 0 

10 TAAGGTCGGT CTAGATAGAG CTAAAACCTT CCTTAATGGT CTTGGTATCG 450 

ACTATCCAAG CATGCATTAT GCAAACGCCA TTTCAAGTAA TACAACAGAA 500 

TCTAATAAAC AATACGGAGC AAGTAGTGAA AAAATGGCTG CTGCTTATGC 550 

TGCCTTTGCC AACGGTGGAA TTTACCACAA ACCCATGTAT ATCAATAAGG 60 0 

TCGTCTTCAG TGACGGTAGT AAAAAAGAAT TTTCAGATGT AGGTACACGA 650 

15 GCTATGAAAG AAACAACTGC TTACATGATG ACCGAAATGA TGAAAACTGT 70 0 

CTTGGCATAC GGAACTGGTC GTGGAGCCTA TCTCCCATGG TTAGCGCAAG 750 

CTGGTAAGAC AGGTACTTCT AACTACACAG ATGATGAAAT TGAAAAACAC 8 00 

ATCAAGAACA CTGGCTATGT AGCTCCAGAT GAAATGTTTG TTGGTTATAC 850 

TCGTAAGTAT TCTATGGCTG TATGGACAGG TTATTCGAAT CGTTTAACTC 900 

2 0 CTATCGTTGG AGATGGTTTC CTAGTTGCAG CTAAAGTTTA TCGCTCAATG 950 

ATAACGTATC TATCAGAAGA TACTCATCCA GAAGACTGGA CGATGCCAGA 1000 

CGGACTTTTC AGAAACGGGG AATTTGTATT CAAAAATGGA GCTCGTTCTA 1050 

CGTGGAACTC ACCTGCTCCA CAACAACCCC CATCAACTGA AAGTTCAAGC 1100 

TCATCATCAG ATAGTTCAAC TTCACAGTCT AGCTCAACCA CTCCAAGCAC 1150 

25 AAATAATAGT ACGACTACCG ATCCTAACAA TAATACGCAA CAATCAAATA 1200 

C 1201 



30 2) INFORMATION FOR SEQ ID NO: 1013 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 122 0 bases 

(B) TYPE: Nucleic acid 
35 (C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

40 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus pneumoniae 

(B) STRAIN: StrR-10 



45 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1013 



TGTAGACCAA GAAGCTCAAA AACATCTGTG GGATATCTAC AACTCCGATC 50 

AATACGTCTC TTACCCTGAC GATGATTTGC AAGTCGCATC TACGGTCGTA 100 

GATGTTTCAA ATGGTAAAGT CATCGCACAA CTTGGTGCTC GTCATCAAGC 150 

AAGTAATGTT TCATTCGGTA CCAACCAGGC CGTAGAAACC AATCGTGACT 200 

50 GGGGATCATC AATGAAACCA ATCACTGACT ATGCTCCCGC TTTAGAATAT 250 

GGAGTCTATG ACTCTACTGC TTCTATTGTA CATGATGTCC CTTATAACTA 300 

1 TCCTGGCACT GATACTCCAC TCTACAACTG GGATCATGTC TACTTTGGAA 350 

y ACATTACAAT CCAGTATGCT CTTCAACAAT CACGAAATGT CACAGCCGTT 4 00 

GAGACTTTGA ATAAGGTCGG TCTAGATAGA GCTAAAACCT TCCTTAATGG 4 50 

55 TCTTGGTATC GACTATCCAA GCATGCATTA TGCAAACGCC ATTTCAAGTA 500 

ACACAACTGA ATCCAACAAA AAATATGGTG CAAGTAGTGA AAAAATGGCT 550 

GCTGCCTACG CTGCTTTTGC TAATGGTGGT ATTTATCACA AACCAATGTA 600 

TATCAATAAA ATCGTCTTTA GTGATGGTAG CGAAAAAGAA TTTTCTGATG 650 

CTGGTACACG AGCTATGAAA GAGACTACTG CCTATATGAT GACTGAAATG 700 

60 ATGAAAACTG TTTTAACTTA CGGAACAGGA CGTGGAGCCT ACCTACCATG 750 
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GCTTCCACAA GCAGGTAAGA CAGGTACTTC TAACTATACT GACGAAGAAA 800 

TTGAAAAGTA TATCAAGAAC ACTGGCTACG TAGCTCCAGA TGAAATGTTT 850 

GTGGGTTATA CTCGTAAGTA TTCTATGGCT GTATGGACAG GTTATTCGAA 900 

TCGTTTAACT CCTATCGTTG GAGATGGTTT CCTAGTTGCA GCTAAAGTTT 950 

5 ATCGCTCTAT GATGACCTAC CTGTCTGAAG GAAGCAATCC AGAGGATTGG 1000 

AATATACCAG AGGGGCTCTA CAGAAATGGA GAATTCGTAT TTAAAAATGG 1050 

TGCTCGTTCT ACGTGGAGCT CACCTGCTCC ACAACAACCC CCATCAACTG 1100 

AAAGTTCAAG CTCATCATCA GATAGTTCAA CTTCACAGTC TAGCTCAACC 1150 

ACTCCAAGCA CAAATAATAG TACGACTACC AATCCTAACA ATAATACGCA 1200 

10 ACAATCAAAT ACAACCCCTG 1220 



15 



55 



2) INFORMATION FOR SEQ ID NO: 1014 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1199 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 
20 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
25 (A) ORGANISM: Streptococcus pneumoniae 

(B) STRAIN: StrR-11 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1014 

30 CAAAAACATC TGTGGGATAT TTACAATACA GACGAATACG TTGCCTATCC 50 

AGACGATGAA TTGCAAGTCG CTTCTACCAT TGTTGATGTT TCTAACGGTA 100 

AAGTCATTGC CCAGCTAGGA GCACGCCATC AGTCAAGTAA TGTTTCCTTC 150 

GGAATTAACC AAGCAGTAGA AACAAACCGC GACTGGGGAT CAACTATGAA 200 

ACCGATCACA GACTATGCTC CTGCCTTGGA GTACGGTGTC TACGATTCAA 2 50 

35 CTGCTACTAT CGTTCACGAT GAGCCCTATA ACTACCCTGG GACAAATACC 300 

CCTGTTTATA ACTGGGATAG GGGCTACTTT GGCAACATCA CCTTGCAATA 350 

CGCCCTGCAA CAATCGCGAA ACGTCCCAGC CGTGGAAACT CTAAACAAGG 400 

TCGGACTCAA CCGCGCCAAG ACTTTCCTAA ATGGTCTCGG AATCGACTAC 450 

CCAAGTATTC ACTACTCAAA TGCCATTTCA AGTAACACAA CCGAATCAGA 500 

40 CAAAAAATAT GGAGCAAGTA GTGAAAAGAT GGCTGCTGCT TACGCTGCCT 550 

TTGCAAATGG TGGAACTTAC TATAAACCAA TGTATATCCA TAAAGTCGTC 600 

TTTAGTGATG GGAGTGAAAA AGAGTTCTCT AATGTCGGAA CTCGTGCCAT 650 

GAAGGAAACG ACAGCCTATA TGATGACCGA. CATGATGAAA ACTGTCTTAG 700 

TATACGGAAT CGGACGTGGA GCCTACCTAC CTTGGCTTCC ACAAGCAGGT 750 

45 AAAACAGGTA CTTCTAACTA TACTGACGAA GAAATTGAAA AGTATATCAA 800 

GAACACTGGT TACGTAGCTC CAGATGAAAT GTTTGTAGGG TATACCCGCA 850 

AATATGCAAT GGCTGTATGG ACAGGCTATT CTAACCGTCT GACACCACTT 900 

GTAGGCGATG GCCTTACGGT CGCTGCTAAA GTTTACCGCT CTATGATGAC 950 

CTACCTGTCT GAAGGAAGCA ATCCAGAGGA TTGGAATATA CCAGAGGGGC 1000 

50 TCTACAGAAA TGGAGAATTC GTATTTAAAA ATGGTGCTCG TTCTACGTGG 1050 

AACTCACCTG CTCCACAACA ACCCCCATCA ACTGAAAGTT CAAGCTCATC 1100 

ATCAGATAGT TCAACTTCAC AGTCTAGCTC AACCACTCCA AGCACAAATA 1150 

ATAGTACGAC TACCAATCCT AACAATAATA CGCAACAATC AAATACAAC 1199 



2) INFORMATION FOR SEQ ID NO: 1015 



(i) SEQUENCE CHARACTERISTICS: 
60 (A) LENGTH: 1211 bases 
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(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus pneumoniae 

(B) STRAIN: StrR-12 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1015 



AAGCTCAAAA ACATCTGTGG GATATCTACA ACTCCGATCA ATACGTCTCT 50 

TACCCTGACG ATGATTTGCA AGTCGCATCT ACGGTCGTAG ATGTTTCAAA 100 

15 TGGTAAAGTC ATCGCCCAAC TTGGAGCTCG TCACCAAGCA AGTAACGTTT 150 

CATTTGGTAC CAACCAAGCT GTGGAAACCA ATCGTGACTG GGGTTCTGCT 200 

ATGAAACCAA TCACCGATTA TGCACCTGCC ATAGAATACG GTGTTTATGA 2 50 

TTCCACTGCA ACTATGGTTA ATGATATTCC TTATAACTAT CCGGGAACAA 3 00 

GCACACCTGT CTACAACTGG GATAGAGCAT ATTTCGGTAA TATTACTCTG 350 

2 0 CAATATGCTC TTCAACAATC ACGAAATGTC ACAGCCGTTG AGACTTTGAA 4 00 

TAAGGTCGGT CTAGATAGAG CTAAAACCTT CCTTAATGGT CTTGGTATCG 450 

ACTATCCAAG CATGCATTAT GCAAACGCCA TTTCAAGTAA TACAACAGAA 500 

TCTAATAAAC AATACGGAGC AAGTAGTGAA AAAATGGCTG CTGCTTATGC 550 

TGCCTTTGCC AACGGTGGAA TTTACCACAA ACCCATGTAT ATCAATAAGG 6 00 

25 TCGTCTTCAG TGACGGTAGT AAAAAAGAAT TTTCAGATGT AGGTACACGA 650 

GCTATGAAAG AAACAACTGC TTACATGATG ACCGAAATGA TGAAAACTGT 700 

CTTGGCATAC GGAACTGGTC GTGGAGCCTA TCTCCCATGG TTAGCGCAAG 750 

CTGGTAAGAC AGGTACTTCT AACTACACAG ATGATGAAAT TGAAAAACAC 800 

ATCAAGAACA CTGGCTATGT AGCTCCAGAT GAAATGTTTG TTGGTTATAC 850 

30 TCGTAAGTAT TCTATGGCTG TATGGACAGG TTATTCGAAT CGTTTAACTC 900 

CTATCGTTGG AGATGGTTTC CTAGTTGCAG CTAAAGTTTA TCGC TCAATG 950 

ATAACGTATC TGTCTGAAGG AAGCAATCCA GAGGATTGGA ATATACCAGA 10 00 

GGGGCTCTAC AGAAATGGAG AATTCGTATT TAAAAATGGT GCTCGTTCTA 1050 

CGTGGAGCTC ACCTGCTCCA CAACAACCCC CATCAACTGA AAGTTCAAGC 1100 

35 TCATCATCAG AT AGTT CAAC TTCACAGTCT AACTCAACCA CTCCAAGCAC 1150 

AAATAATAGT ACGACTACCA ATCCTAACAA TAATACGCAA CAATCAAATA 1200 

CAACCCCTGA T 1211 



40 



50 



2) INFORMATION FOR SEQ ID NO: 1016 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 1222 bases 
45 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus pneumoniae 

(B) STRAIN: StrR-13. 

55 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1016 

GTAGACCAAG AAGCTCAAAA ACATCTGTGG GATATCTACA ACTCCGATCA 50 

ATACGTCTCT TACCCTGACG ATGATTTGCA AGTCGCATCT ACGGTCGTAG 100 

ATGTTTCAAA TGGTAAAGTC ATCGCACAAC TTGGTGCTCG TCATCAAGCA 150 

60 AGTAATGTTT CATTCGGTAC CAACCAGGCC GTAGAAACCA ATCGTGACTG 200 
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GGGATCATCA ATGAAACCAA TCACTGACTA TGCTCCCGCT TTAGAATATG 2 50 

GAGTCTATGA CTCTACTGCT TCTATTGTAC ATGATGTCCC TTATAACTAT 3 00 

CCTGGCACTG ATACTCCACT CTACAACTGG GATCATGTCT ACTTTGGAAA 350 

CATTACAATC CAGTATGCTC TTCAACAATC ACGAAATGTC ACAGCCGTTG 400 

5 AGACTTTGAA TAAGGTCGGT CTAGATAGAG CTAAAACCTT CCTTAATGGT 450 

CTTGGTATCG ACTATCCAAG CATGCATTAT GCAAACGCCA TTTCAAGTAA 500 

CACAACTGAA TCCAACAAAA AATATGGTGC AAGTAGTGAA AAAATGGCTG 550 

CTGCCTACGC TGCTTTTGCT AATGGTGGTA TTTATCACAA ACCAATGTAT 60 0 

ATCAATAAAA TCGTCTTTAG TGATGGTAGC GAAAAAGAAT TTTCTGATGC 650 

10 TGGTACACGA GCTATGAAAG AGACTACTGC CTATATGATG ACTGAAATGA 700 

TGAAAACTGT TTTAACTTAC GGAACAGGAC GTGGAGCCTA CCTACCATGG 750 

CTTCCACAAG CAGGTAAGAC AGGTACTTCT AACTATACTG ACGAAGAAAT 800 

TGAAAAGTAT ATCAAGAACA CTGGCTACGT AGCTCCAGAT GAAATGTTTG 850 

TGGGTTATAC TCGTAAGTAT TCTATGGCTG TATGGACAGG TTATTCGAAT 900 

15 CGTTTAACTC CTATCGTTGG AGATGGTTTC CTAGTTGCAG CTAAAGTTTA 950 

TCGCTCTATG ATGACCTACC TGTCTGAAGG AAGCAATCCA GAGGATTGGA 1000 

ATATACCAGA GGGGCTCTAC AGAAATGGAG AATTCGTATT TAAAAATGGT 1050 

GCTCGTTCTA CGTGGAGCTC ACCTGCTCCA CAACAACCCC CATCAACTGA 110 0 

AAGTTCAAGC TCATCATCAG ATAGTTCAAC TTCACAGTCT AGCTCAACCA 1150 

20 CTCCAAGCAC AAATAATAGT ACGACTACCA ATCCTAACAA TAATACGCAA 12 00 

CAATCAAATA CAACCCCTGA TC 1222 



25 2) INFORMATION FOR SEQ ID NO: 1017 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1229 bases 

(B) TYPE: Nucleic acid 
30 (C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

35 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus pneumoniae 

(B) STRAIN: StrR-14 



40 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1017 



TGTAGACCAA GAAGCTCAAA AACATCTGTG GGATATTTAC AATACAGACG 50 

AATACGTTGC CTATCCAGAC GATGAATTGC AAGTCGCTTC T AC CATTGTT 10 0 

GATGTTTCTA ACGGTAAAGT CATTGCCCAG CTAGGAGCAC GCCATCAGTC 150 

AAGTAATGTT TCCTTCGGAA TTAACCAAGC AGTAGAAACA AACCGCGACT 20 0 

45 GGGGATCAAC TATGAAACCG ATCACAGACT ATGCTCCTGC CTTGGAGTAC 2 50 

GGTGTCTACG ATTCAACTGC TACTATCGTT CACGATGAGC CCTATAACTA 3 00 

CCCTGGGACA AATACCCCTG TTTATAACTG GGATAGGGGC TACTTTGGCA 350 

ACATCACCTT GCAATACGCC CTGCAACAAT CGCGAAACGT CCCAGCCGTG 4 00 

GAAACTCTAA ACAAGGTCGG ACTCAACCGC GCCAAGACTT TCCTAAATGG 4 50 

50 TCTCGGAATC GACTACCCAA GTATTCACTA CTCAAATGCC ATTTCAAGTA 500 

ACACAACCGA ATCAGACAAA AAATATGGAG CAAGTAGTGA AAAGATGGCT 550 

GCTGCTTACG CTGCCTTTGC AAATGGTGGA ACTTACTATA AACCAATGTA 60 0 

TATCCATAAA GTCGTCTTTA GTGATGGGAG TGAAAAAGAG TTCTCTAATG 650 

TCGGAACTCG TGCCATGAAA GAAACAACTG CTTACATGAT GACCGAAATG 700 

55 ATGAAAACTG TCCTGGCATA CGGAAGTGGT CGTGGAGCCT ATCTCCCATG 750 

GTTAGCGCAA GCTGGTAAGA CAGGTACTTC TAACTACACA GATGATGAAA 800 

TTGAAAAACA CATCAAGAAC ACTGGCTATG TAGCTCCAGA TGAAATGTTT 850 

GTTGGTTATA CTCGTAAGTA TTCTATGGCT GTATGGACAG GTTATTCGAA 900 

TCGTTTAACT CCTATCGTTG GAGATGGTTT CCTAGTTGCA GCTAAAGTTT 950 

60 ATCGCTCAAT GATAACGTAT CTATCAAAAG ATACTCATCC AGAAGACTGG 1000 
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ACGATGCCAG ACGGACTTTT CAGAAACGGG 
AGCTCGTTCT ACGTGGAACT CACCTGCTCC 
AAAGTTCAAG CTCATCATCA GATAGTTCAA 
ACTCCAAGCA CAAATAATAG TACGACTACC 
5 ACAATCAAAT ACAACCCCTG ATCAACAAA 



GAATTTGTAT TCAAAAATGG 1050 
ACAACAACCC CCATCAACTG 1100 
CTTCACAGTC TAACTCAACC 1150 
AATCCTAACA ATAATACGCA 12 00 

1229 



10 



2) INFORMATION FOR SEQ ID NO: 1018 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1225 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 
15 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
20 (A) ORGANISM: Streptococcus pneumoniae 

(B) STRAIN: StrR-15 ' 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1018 

25 ATGTAGACCA AGAGGCTCAA AAACGTCTGT GGGATATCTA CAACTCCGAT 50 

CAATACGTCT CTTACCCTGA CGATGATTTG CAAGTCGCAT CTACGGTCGT 100 

AGATGTTTCA AATGGTAAAG TCATCGCCCA ACTTGGAGCT CGTCACCAAG 150 

CAAGTAACGT TTCATTTGGT ACCAACCAAG CTGTGGAAAC CAACCGTGAC 2 00 

TGGGGATCAA GCATGAAACC AATCACTGAT TATGCCCCAG CCTTAGAATA 250 

30 TGGTGTATAT GATTCCACTG CAACTATGGT TAATGATATT CCTTATAACT 3 00 

ATCCGGGCAC AAGCACACCT GTCTACAACT GGGATCGAGC ATATTTTGGT 350 

AATATTAGCC TGCAATATGC CCTTCAACAA TCTCGTAACG TGCCTGCCGT 4 00 

TGAAACACTA AACAAGGTTG GTTTAGATAG AGCCAAAACT TTCCTAAATG 450 

GTTTGGGAAT TGACTATCCA AGTATTCACT ACTCAAATGC TATTTCAAGT 50 0 

35 AATACAACTG AATCTAGTAA ACAGTACGGG GCAAGCAGTG AGAAAATGGC 550 

TGCGGCTTAC GCTGCATTCG CTAATGGCGG TATTTACCAC AAACCAATGT 600 

ACATCAATAA AGTTGTCTTT AGTGATGGTA GCGAAAAAGA ATTTTCTGAT 650 

GCTGGTACAC GAGCTATGAA AGAGACTACT GCCTATATGA TGACTGAAAT 7 00 

GATGAAAACT GTTTTAACTT ACGGAACAGG ACGTGGAGCC TACCTACCAT 750 

40 GGCTTCCACA AGCAGGTAAG ACAGGTACTT CTAACTATAC TGACGACGAA 800 

ATTGAAAAGT ATATCAAGAA CACTGGCTAC GTAGCTCCAG ATGAAATGTT 850 

TGTGGGTTAT ACTCGTAAGT ATTCTATGGC TGTATGGACT GGATACTCAA 900 

ATCGTTTAAC TCCAATCATT GGAGATGGTT TCCTAGTTGC TGCCAAAGTT 950 

TATCGCTCAA TGATATCGTA TCTATCAGAA GATGACCATC CTGGAGATTG 1000 

45 GACAATGCCT GAGGGAGTAT ACAGAAGTGG AGAATTCGTA TTTAAAAATG 1050 

GTGCTCGTTC TACGTGGAGC TCCCCTGCTC CACAACAACC CCCATCAACT 1100 

GAAAGTTCAA GCTCATCATC AGATAGTTCA ACTTCACAGT CTAGCTTAAC 1150 

CACTCCAAGC ACAAATAATA GTACGACTAC CAATCCTAAC AACAATACGC 12 00 

AACAATCAAA TACAACCCCT GATCA 1225 

50 



2) INFORMATION FOR SEQ ID NO: 1019 

55 - (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 143 9 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 
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(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus pneumoniae 
5 (B) STRAIN: StrR-01 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1019 

GCCTCTATTT CAAAGGAGAT GCCTGGCATT AGTATTTCTA CTTCTTGGGA 50 

10 TCGAAAGGTT TTGGAAACTT CCCTTTCTTC TATAGTAGGG AGTGTATCCA 100 

GTGAAAAAGC TGGTCTCCCA GCGGAAGAAG CAGAATCCTA TCTTAAAAAA 150 

GGCTATTCTC TAAATGACCG TGTTGGAACC TCCTATTTGG AAAAGCAATA 200 

TGAAGAGACC TTACAAGGAA AACGCTCGGT AAAAGAAATC CATCTGGATA 250 

AATATGGCAA TATGGAAAGC GTGGACACAA TTGAGGAAGG TAGTAAGGGA 3 00 

15 AACAATATCA AACTGACCAT TGATTTGGCC TTCCAAGATA GCGTGGATGC 350 

TTTGCTGAAA AGTTATTTCA ATTCCGAGCT AGGAAATGGT GGAGCTAAAT 400 

ATTCTGAAGG TGTCTATGCA GTCGCCCTTA ACCCAAAAAC AGGTGCTGTT 450 

TTATCCATGT CAGGGATCAA ACATGACCTG AAAACGGGAG AGTTGACTCC 500 

TGATTCCTTG GGAACGGTAA CCAATGTCTT TGTCCCAGGT TCGGTTGTCA 550 

20 AGGCTGCGAC CATCAGCTCA GGTTGGGAAA ATGGTGTTTT ATCAGGAAAC 600 

CAAACCTTAA CAGATCAGCC TATTGTTTTC CAAGGTTCAG CTCCAATTTA 650 

TTCTTGGTAT AAATTGGCAT ATGGATCTTT TCCTATTACA GCTGTGGAAG 700 

CCTTGGAGTA TTCATCCAAT GCTTACATGG TTCAAACCGC TCTTGGAATC 7 SO 

ATGGGCCAGA CCTATCAACC AAATATGTTT GTTGGAACCA GCAATTTGGA 800 

25 AACAGCTATG GGAAAACTTC GTGCGACCTT TGGCGAATAT GGCTTGGGGG 850 

CTGCGACCGG AATTGACCTA CCAGATGAAT CTACTGGATT TGTTCCCAAA 900 

GAGTATAGCT TTGCTAATTA CATCACCAAT TCCTTTGGGC AGTTTGATAA 950 

CTATACGCCC ATGCAGTTGG CTCAGTATGT AGCAACTATT GCAAATAATG 1000 

GTGTTCGTGT GGCTCCTCGT ATTGTTGAAG GCATTTATGG TAATAATGAT 1050 

3 0 AAGGGAGGAC TGGGTGACTT GATTCAGCAA CTGCAACCGA CAGAGATGAA 1100 

TAAGGTCAAT ATATCCGACT CCGATATGAG CATCTTGCAC CAAGGTTTTT 1150 

ATCAGGTTGC CCATGGTACT AGTGGATTGA CAACTGGACG TGCCTTTTCA 12 00 

AATGGTGCCT TGGTATCCAT TAGCGGAAAA ACAGGTACAG CCGAAAGCTA 12 50 

TGTGGCAGAT GGTCAGCAAG CAACCAATAC CAATGCGGTG GCCTATGCCC 13 00 

35 CATCTGATAA TCCCCAAATC GCTGTCGCAG TGGTCTTTCC TCATAATACC 13 50 

AATCTAACAA ATGGTGTAGG ACCTTCCATT GCGCGTGACA TTATCAATCT 1400 

GTATCAAAAA TACCATCCAA TGAACTAGAA AGGAAATTA 1439 



40 



50 



2 ) INFORMATION FOR SEQ ID NO : 1020 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 1441 bases 
45 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus pneumoniae 

(B) STRAIN: StrR-02 

55 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1020 

GCCTCTATTT CAAAGGAGAT GCCTGGCATT AGTATTTCTA CTTCTTGGGA 50 

TCGAAAGGTT TTGGAAACTT CCCTTTCTTC TATAGTAGGG AGTGTATCCA 100 

GTGAAAAAGC TGGTCTCCCA GCGGAAGAAG CAGAATCCTA TCTTAAAAAA 150 

60 GGCTATTCTC TAAATGACCG TGTTGGAACC TCCTATTTGG AAAAGCAATA 200 
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TGAAGAGACC 
AATATGGCAA 
' AACAATATCA 
TTTGCTGAAA 
5 ATTCTGAAGG 
TTATCCATGT 
TGATTCCTTG 
AGGCTGCGAC 
CAAACCTTAA 

10 TTCTTGGTAT 
CCTTGGAGTA 
ATGGGCCAGA 
AACAGCTATG 
CTGCGACCGG 

15 GAGTATAGCT 
CTATACGCCC 
GTGTTCGTGT 
AAGGGAGGAC 
TAAGGTCAAT 

2 0 ATCAGGTTGC 
AATGGTGCCT 
TGTGGCAGAT 
CATCTGATAA 
AATCTAACAA 

25 GTATCAAAAA 



TTACAAGGAA 
TATGGAAAGC 
AACTGACCAT 
AGTTATTTCA 
TGTCTATGCA 
CAGGGATCAA 
GGAACGGTAA 
CATCAGCTCA 
CAGATCAGCC 
AAATTGGCAT 
TTCATCCAAT 
CCTATCAACC 
GGAAAACTTC 
AATTGACCTA 
TTGCTAATTA 
ATGCAGTTGG 
GGCTCCTCGT 
TGGGTGACTT 
ATATCCGACT 
CCATGGTACT 
TGGTATCCAT 
GGTCAGCAAG 
TCCCCAAATC 
ATGGTGTAGG 
TACCATCCAA 



AACGCTCGGT 
GTGGACACAA 
TGATTTGGCC 
ATTCCGAGCT 
GTCGCCCTTA 
ACATGACCTG 
CCAATGTCTT 
GGTTGGGAAA 
TATTGTTTTC 
ATGGATCTTT 
GCTTACATGG 
AAATATGTTT 
GTGCGACCTT 
CCAGATGAAT 
CATCACCAAT 
CTCAGTATGT 
ATTGTTGAAG 
GATTCAGCAA 
CCGATATGAG 
AGTGGATTGA 
TAGCGGAAAA 
CAACCAATAC 
GCTGTCGCAG 
ACCTTCCATT 
TGAACTAGAA 



AAAAGAAATC 
TTGAGGAAGG 
TTCCAAGATA 
AGGAAATGGT 
ACCCAAAAAC 
AAAACGGGAG 
TGTCCCAGGT 
ATGGTGTTTT 
CAAGGTTCAG 
TCCTATTACA 
TTCAAACCGC 
GTTGGAACCA 
TGGCGAATAT 
CTACTGGATT 
TCCTTTGGGG 
AGCAACTATT 
GCATTTATGG 
CTGCAACCGA 
CATCTTGCAC 
CAACTGGACG 
ACAGGTACAG 
CAATGCGGTG 
TGGTCTTTCC 
GCGCGTGACA 
AGGAAATTAT 



CATCTGGATA 250 

TAGTAAGGGA 3 00 

GCGTGGATGC 350 

GGAGCTAAAT 4 00 

AGGTGCTGTT 450 

AGTTGACTCC 500 

TCGGTTGTCA 550 

ATCAGGAAAC 600 

CTCCAATTTA 650 

GCTGTGGAAG 700 

TCTTGGAATC 750 
GCAATTTGGA 800 
GGCTTGGGGG 850 
TGTTCCCAAA 900 
AGTTTGATAA 950 

GCAAATAATG 1000 

TAATAATGAT 1050 

CAGAGATGAA 1100 

CAAGGTTTTT 1150 

TGCCTTTTCA 12 00 

CCGAAAGCTA 12 50 

GCCTATGCCC 13 00 

TCATAATACC 13 50 

TTATCAATCT 14 00 

G 1441 



2) INFORMATION FOR SEQ ID NO: 1021 



30 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13 96 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 
35 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
40 (A) ORGANISM: Streptococcus pneumoniae 

(B) STRAIN: StrR-03 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1021 

4 5 TATTGCCTCT ATTTCAAAGG AGATGCCTGG CATTAGTATT TCTACTTCTT 50 

GGGATCGAAA GGTTTTGGAA ACTTCCCTTT CTTCTATAGT AGGGAGTGTA 100 

TCCAGTGAAA AAGCTGGTCT CCCAGCGGAA GAAGCAGAAT CCTATCTTAA 150 

AAAAGGCTAT TCTCTAAATG ACCGTGTTGG AACCTCCTAT TTGGAAAAGC 200 

AATATGAAGA GACCTTACAA GGAAAACGCT CGGTAAAAGA AATCCATCTG 2 50 

50 GATAAATATG GCAATATGGA AAGCGTGGAC ACAATTGAGG AAGGTAGTAA 3 00 

GGGAAACAAT ATCAAACTGA CCATTGATTT GGCCTTCCAA GATAGCGTGG 3 50 

ATGCTTTGCT GAAAAGTTAT TTCAATTCCG AGCTAGGAAA TGGTGGAGCT 400 

AAATATTCTG AAGGTGTCTA TGCAGTCGCC CTTAACCCAA AAACAGGTGC 450 

TGTTTTATCC ATGTCAGGGA TCAAACATGA CCTGAAAACG GGAGAGTTGA 500 

55 CTCCTGATTC CTTGGGAACG GTAACCAATG TCTTTGTCCC AGGTTCGGTT 550 

GTCAAGGCTG CGACCATCAG CTCAGGTTGG GAAAATGGTG TTTTATCAGG 600 

AAACCAAACC TTAACAGATC AGCCTATTGT TTTCCAAGGT TCAGCTCCAA 650 

TTTATTCTTG GTATAAATTG GCATATGGAT CTTTTCCTAT TACAGCTGTG 700 

GAAGCCTTGG AGTATTCATC CAATGCTTAC ATGGTTCAAA CCGCTCTTGG 750 

60 AATCATGGGC CAGACCTATC AACCAAATAT GTTTGTTGGA ACCAGCAATT 800 
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TGGAAACAGC TATGGGAAAA CTTCGTGCGA 
GGGGCTGCGA CCGGAATTGA CCTACCAGAT 
CAAAGAGTAT AGCTTTGCTA ATTACATCAC 
ATAACTATAC GCCCATGCAG TTGGCTCAGT 
5 AATGGTGTTC GTGTGGCTCC TCGTATTGTT 
TGATAAGGGA GGACTGGGTG ACTTGATTCA 
TGAATAAGGT CAATATATCC GACTCCGATA 
TTTTATCAGG TTGCCCATGG TACTAGTGGA 
TTCAAATGGT GCCTTGGTAT CCATTAGCGG 
10 GCTATGTGGC AGATGGTCAG CAAGCAACCA 
GCCCCATCTG ATAATCCCCA AATCGCTGTC 
TACCAATCTA ACAAATGGTG TAGGACCTTC 



PCT/CA00/01150 

CCTTTGGCGA ATATGGCTTG 850 

GAATCTACTG GATTTGTTCC 900 

CAATTCCTTT GGGCAGTTTG 950 

ATGTAGCAAC TATTGCAAAT 1000 

GAAGGCATTT ATGGTAATAA 1050 

GCAACTGCAA CCGACAGAGA 1100 

TGAGCATCTT GCACCAAGGT 1150 

TTGACAACTG GACGTGCCTT 12 00 

AAAAACAGGT ACAGCCGAAA 12 50 

ATACCAATGC GGTGGCCTAT 13 00 

GCAGTGGTCT TTCCTCATAA 13 50 

CATTGCGCGT GACATT 13 96 



15 



2) INFORMATION FOR SEQ ID NO: 1022 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 1428 bases 
2 0 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



25 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: streptococcus pneumoniae 

(B) STRAIN: StrR-04 

3 0 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1022 

AAAGGAGATG CCTGGCATTA GTATTTCTAC TTCTTGGGAT CGAAAGGTTT 50 
TGGAAACTTC CCTTTCTTCT ATAGTAGGGA GTGTATCCAG TGAAAAAGCT 100 
GGTCTCCCAG CGGAAGAAGC AGAATCCTAT CTTAAAAAAG GCTATTCTCT 150 

3 5 AAATGACCGT GTTGGAACCT CCTATTTGGA AAAGCAATAT GAAGAGACCT 2 00 

TACAAGGAAA ACGCTCGGTA AAAGAAATCC ATCTGGATAA ATATGGCAAT 250 
ATGGAAAGCG TGGACACAAT TGAGGAAGGT AGTAAGGGAA ACAATATCAA 300 
ACTGACCATT GATTTGGCCT TCCAAGATAG CGTGGATGCT TTGCTGAAAA 350 
GTTATTTCAA TTCCGAGCTA GGAAATGGTG GAGCTAAATA TTCTGAAGGT 4 00 

40 GTCTATGCAG TCGCCCTTAA CCCAAAAACA GGTGCTGTTT TATCCATGTC 450 
AGGGATCAAA CATGACCTGA AAACGGGAGA GTTGACTCCT GATTCCTTGG 500 
GAACGGTAAC CAATGTCTTT GTCCCAGGTT CGGTTGTCAA GGCTGCGACC 550 
ATCAGCTCAG GTTGGGAAAA TGGTGTTTTA TCAGGAAACC AAACCTTAAC 60 0 

AGATCAGCCT ATTGTTTTCC AAGGTTCAGC TCCAATTTAT TCTTGGTATA 650 

4 5 AATTGGCATA TGGATCTTTT CCTATTACAG CTGTGGAAGC CTTGGAGTAT 700 

TCATCCAATG CTTACATGGT TCAAACCGCT CTTGGAATCA TGGGCCAGAC 750 
CTATCAACCA AATATGTTTG TTGGAACCAG CAATTTGGAA ACAGCTATGG 800 
GAAAACTTCG TGCGACCTTT GGCGAATATG GCTTGGGGGC TGCGACCGGA 850 
ATTGACCTAC CAGATGAATC TACTGGATTT GTTCCCAAAG AGTATAGCTT 900 

50 TGCTAATTAC ATCACCAATT CCTTTGGGCA GTTTGATAAC TATACGCCCA 950 

TGCAGTTGGC TCAGTATGTA GCAACTATTG CAAATAATGG TGTTCGTGTG 1000 

GCTCCTCGTA TTGTTGAAGG CATTTATGGT AATAATGATA AGGGAGGACT 1050 

GGGTGACTTG ATTCAGCAAC TGCAACCGAC AGAGATGAAT AAGGTCAATA 1100 

TATCCGACTC CGATATGAGC ATCTTGCACC AAGGTTTTTA TCAGGTTGCC 1150 

55 CATGGTACTA GTGGATTGAC AACTGGACGT GCCTTTTCAA ATGGTGCCTT 12 00 

GGTATCCATT AGCGGAAAAA CAGGTACAGC CGAAAGCTAT GTGGCAGATG 12 50 

GTCAGCAAGC AACCAATACC AATGCGGTGG CCTATGCCCC AT CTGATAAT 13 00 

CCCCAAATCG CTGTCGCAGT GGTCTTTCCT CATAATACCA ATCTAACAAA 1350 

TGGTGTAGGA CCTTCCATTG CGCGTGACAT TATCAATCTG TATCAAAAAT 14 00 

60 ACCATCCAAT GAACTAGAAA GGAAATTA 1428 
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2) INFORMATION FOR SEQ ID NO: 1023 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1442 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus pneumoniae 

(B) STRAIN: StrR-05 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1023 

TGCCTCTATT TCAAAGGAGA TGCCTGGCAT TAGTATTTCT ACTTCTTGGG 50 
ATCGAAAGGT TTTGGAAACT TCCCTTTCTT CTATAGTAGG GAGTGTATCC 100 
AGTGAAAAAG CTGGTCTCCC AGCGGAAGAA GCAGAATCCT ATCTTAAAAA 150 
AGGCTATTCT CTAAATGACC GTGTTGGAAC CTCCTATTTG GAAAAGCAAT 2 00 

ATGAAGAGAC CTTACAAGGA AAACGCTCGG TAAAAGAAAT CCATCTGGAT 2 50 

AAATATGGCA ATATGGAAAG CGTGGACACA ATTGAGGAAG GTAGTAAGGG 300 
AAACAATATC AAACTGACCA TTGATTTGGC CTTCCAAGAT AGCGTGGATG 350 
CTTTGCTGAA AAGTTATTTC AATTCCGAGC TAGGAAATGG TGGAGCTAAA 4 00 

TATTCTGAAG GTGTCTATGC AGTCGCCCTT AACCCAAAAA CAGGTGCTGT 450 
TTTATCCATG TCAGGGATCA AACATGACCT GAAAACGGGA GAGTTGACTC 50 0 

CTGATTCCTT GGGAACGGTA ACCAATGTCT TTGTCCCAGG TTCGGTTGTC 550 
AAGGCTGCGA CCATCAGCTC AGGTTGGGAA AATGGTGTTT TATCAGGAAA 600 
CCAAACCTTA ACAGATCAGC CTATTGTTTT CCAAGGTTCA GCTCCAATTT 650 
ATTCTTGGTA TAAATTGGCA TATGGATCTT TTCCTATTAC AGCTGTGGAA 700 
GCCTTGGAGT ATTCATCCAA TGCTTACATG GTTCAAACCG CTCTTGGAAT 750 
CATGGGCCAG ACCTATCAAC CAAATATGTT TGTTGGAACC AGCAATTTGG 8 00 

AAACAGCTAT GGGAAAACTT CGTGCGACCT TTGGCGAATA TGGCTTGGGG 850 
GCTGCGACCG GAATTGACCT ACCAGATGAA TCTACTGGAT TTGTTCCCAA 900 
AGAGTATAGC TTTGCTAATT ACATCACCAA TTCCTTTGGG CAGTTTGATA 950 

ACTATACGCC CATGCAGTTG GCTCAGTATG TAGCAACTAT TGCAAATAAT 1000 

GGTGTTCGTG TGGCTCCTCG TATTGTTGAA GGCATTTATG GTAATAATGA 1050 

TAAGGGAGGA CTGGGTGACT TGATTCAGCA ACTGCAACCG ACAGAGATGA 1100 

ATAAGGTCAA TATATCCGAC TCCGATATGA GCATCTTGCA CCAAGGTTTT 1150 

TATCAGGTTG CCCATGGTAC TAGTGGATTG ACAACTGGAC GTGCCTTTTC 12 00 

AAATGGTGCC TTGGTATCCA TTAGCGGAAA AACAGGTACA GCCGAAAGCT 1250 

ATGTGGCAGA TGGTCAGCAA GCAACCAATA CCAATGCGGT GGCCTATGCC 13 00 

CCATCTGATA ATCCCCAAAT CGCTGTCGCA GTGGTCTTTC CTCATAATAC 1350 

CAATCTAACA AATGGTGTAG GACCTTCCAT TGCGCGTGAC ATTATCAATC 14 00 

TGTATCAAAA ATACCATCCA ATGAACTAGA AAGGAAATTA TG 1442 



2) INFORMATION FOR SEQ ID NO: 1024 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1445 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

571 
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(vi) ORIGINAL, SOURCE: 

(A) ORGANISM: Streptococcus pneumoniae 

(B) STRAIN: StrR-06 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1024 



TTGCCTCTAT 
GATCGAAAGG 

10 CAGTGAAAAA 
AAGGCTATTC 
TATGAAGAGA 
TAAATATGGC 
GAAACAATAT 

15 GCTTTGCTGA 
ATATTCTGAA 
TTTTATCCAT 
CCTGATTCCT 
CAAGGCTGCG 

20 ACCAAACCTT 
TATTCTTGGT 
AGCCTTGGAG 
TCATGGGCCA 
GAAACAGCTA 

25 GGCTGCGACC 
AAGAGTATAG 
AACTATACAC 
TGGTGTTCGT 
ATAAGGGAGG 

30 AATAAGGTCA 
TTATCAGGTT 
CAAATGGTGC 
TATGTGGCAG 
CCCATCTGAT 

35 CCAATCTAAC 
CTGTATCAAA 



TTCAAAGGAG 
TTTTGGAAAC 
GCTGGTCTCC 
TCTAAATGAC 
CCTTACAAGG 
AATATGGAAA 
CAAACTGACC 
AAAGTTATTT 
GGTGTCTATG 
GTCAGGGATC 
TGGGAACGGT 
ACCATCAGCT 
AACAGATCAG 
ATAAATTGGC 
TATTCATCCA 
GACCTATCAA 
TGGGAAAACT 
GGAATTGACC 
CTTTGCTAAT 
CCATGCAGTT 
GTGGCTCCTC 
ACTGGGTGAC 
ATATATCCGA 
GCCCATGGTA 
CTTGGTATCC 
ATGGTCAGCA 
AATCCCCAAA 
AAATGGTGTA 
AATACCATCC 



ATGCCTGGCA 
TTCCCTTTCT 
CAGCGGAAGA 
CGTGTTGGAA 
AAAACGCTCG 
GCGTGGACAC 
ATTGATTTGG 
CAATTCCGAG 
CAGTCGCCCT 
AAACATGACC 
AACCAATGTC 
CAGGTTGGGA 
CCTATTGTTT 
ATATGGATCT 
ATGCTTACAT 
CCAAATATGT 
TCGTGCGACC 
TACCAGATGA 
TACATCACCA 
GGCTCAGTAT 
GTATTGTTGA 
TTGATTCAGC 
CTCCGATATG 
CTAGTGGATT 
ATTAGCGGAA 
AGCAACCAAT 
TCGCTGTCGC 
GGACCTTCCA 
AATGAACTAG 



TTAGTATTTC 
TCTATAGTAG 
AGCAGAATCC 
CCTCCTATTT 
GTAAAAGAAA 
AATTGAGGAA 
CCTTCCAAGA 
CTAGGAAATG 
TAACCCAAAA 
TGAAAACGGG 
TTTGTCCCAG 
AAATGGTGTT 
TCCAAGGTTC 
TTTCCTATTA 
GGTTCAAACC 
TTGTTGGAAC 
TTTGGCGAAT 
ATCTACTGGA 
ATTCCTTTGG 
GTAGCAACTA 
AGGCATTTAT 
AACTGCAACC 
AGCATCTTGC 
GACAACTGGA 
AAACAGGTAC 
ACCAATGCGG 
AGTGGTCTTT 
TTGCGCGTGA 
AAAGGAAATT 



TACTTCTTGG 50 

GGAGTGTATC 100 

TATCTTAAAA 150 

GGAAAAGCAA 2 00 

TCCATCTGGA 2 50 

GGTAGTAAGG 3 00 

TAGCGTGGAT 350 

GTGGAGCTAA 400 

ACAGGTGCTG 450 

AGAGTTGACT 500 

GTTCGGTTGT 550 

TTATCAGGAA 60 0 

AGCTCCAATT 650 

CAGCTGTGGA 700 

GCTCTTGGAA 75 0 

CAGCAATTTG 800 

ATGGCTTGGG 850 

TTTGTTCCCA 900 

GCAGTTTGAT 950 

TTGCAAATAA 10 00 

GGTAATAATG 1050 

GACAGAGATG 110 0 

ACCAAGGTTT 1150 

CGTGCCTTTT 1200 

AGCCGAAAGC 1250 

TGGCCTATGC 13 00 

CCTCATAATA 1350 

CATTATCAAT 140 0 

ATGCT 1445 



40 2) INFORMATION FOR SEQ ID NO: 1025 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1441 bases 

(B) TYPE: Nucleic acid 
45 (C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

50 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus pneumoniae 

(B) STRAIN: StrR-07 



55 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1025 



TTGCCTCTAT TTCAAAGGAG ATGCCTGGCA TTAGTATTTC TACTTCTTGG 50 

GATCGAAAGA TTTTGGAAAC TTCCCTTTCT TCTATAGTAG GGAGTGTATC 10 0 

CAGTGAAAAA GCTGGTCTCC CAGCGGAAGA AGCAGAATCC TATCTTAAAA 150 

AAGGCTATTC TCTAAATGAC CGTGTTGGAA CCTCGTATTT GGAAAAGCAA 200 

60 TATGAAGAGA CCTTACAAGG AAAACGCTCG GTAAAAGAAA TCCATCTGGA 25 0 
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TAAATATGGC AATATGGAAA GCGTGGACAC AATTGAGGAA GGTAGTAAGG 3 00 

GAAACAATAT CAAACTGACC ATTGATTTGG CCTTCCAAGA TAGCGTGGAT 3 50 

GCTTTGCTGA AAAGTTATTT CAATTCCGAG CTAGGAAATG GTGGAGCCAA 4 00 

GTATTCTGAG GGTGTGTATG CAGTCGCCCT TAACCCCAAA ACAGGTGCTG 450 

5 TTTTGTCTAT GTCAGGACTC AAACATGACC TGAAAACGGG AGAGTTGACT 500 

CCTGATTCCT TGGGAACGGT AACCAATGTC TTTGTCCCAG GTTCGGTTGT 550 

CAAGGCTGCG ACCATCAGCT CTGGGTGGGA AAATGGTGTT TTATCAGGAA 600 

ACCAAACCTT AACAGATCAG CCTATTGTTT TCCAAGGTTC AGCTCCAATT 650 

TATTCTTGGT ATAAATTGGC ATATGGATCT TTTCCTATTA CAGCTGTGGA 700 

10 AGCCTTGGAG TATTCATCCA ATGCTTACAT GGTTCAAACC GCTCTTGGAA 750 

TCATGGGCCA GACCTATCAA CCAAATATGT TTGTTGGAAC CAGCAATTTG 8 00 

GAAACAGCTA TGGGAAAACT TCGTGCGACC TTTGGCGAAT ATGGCTTGGG 850 

GGCTGCGACC GGAATTGACC TACCAGATGA ATCTACTGGA TTTGTTCCCA 900 

AAGACTATAG CTTTGCTAAT TACATCACCA ATGCCTTTGG GCAGTTTGAT 950 

15 AACTATACGC CCATGCAGTT GGCTCAGTAT GTAGCAACTA TTGCAAATGA 1000 

TGGTGTTCGT GTGGCTCCTC GTATTGTTGA AGGCATTTAT GGTAATAATG 1050 

ATAAGGGAGG ACTGGGTGAC TTGATTCAGC AACTGCAACC GACAGAGATG 1100 

AATAAGGTCA ATATATCCGA CTCCGATATG AGTATCTTGC ACCAAGGATT 1150 

TTACCAAGTA TCGCATGGAA CTAGTCCCCT TACGACAGGA CGGGCGTTTT 12 00 

20 CAGATGGCGC CACTGTTTCT ATCAGTGGTA AGACCGGTAC AGGTGAAAGC 12 50 

TATGTAGCTG GTGGTCAAGA AGCTAATAAT ACCAATGCCG TGGCCTATGC 13 00 

TCCAACAGAA AATCCTCAAA TTGCAGTTGC AGTAGTCTTT CCTCATAATA 1350 

CCAATTTAAC CAAAAATGTT GGGCCAGCAA TTGCTCGCGA CATTATCAAT 14 0 0 

TTATATAACC AACACCATCC AATGAATTAG AAAGGAAGCC A 1441 

25 



2) INFORMATION FOR SEQ ID NO: 1026 

30 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1443 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



35 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus pneumoniae 
40 (B) STRAIN: StrR-08 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1026 

TATTGCCTCT ATTTCAAAGG AGATGCCTGG CATTAGTATT TCTACTTCTT 50 

45 GGGATCGAAA GGTTTTGGAA ACTTCCCTTT CTTCTATAGT AGGG AGTGT A . 100 

TCCAGTGAAA AAGCTGGTCT CCCAGCGGAA GAAGCAGAAT CCTATCTTAA 150 

AAAAGGCTAT TCTCTAAATG ACCGTGTTGG AACCTCCTAT TTGGAAAAGC 2 00 

AATATGAAGA GACCTTACAA GGAAAACGCT CGGTAAAAGA AATCCATCTG 250 

GATAAATATG GCAATATGGA AAGCGTGGAC ACAATTGAGG AAGGTAGTAA 3 00 

50 GGGAAACAAT ATCAAACTGA CCATTGATTT GGCCTTCCAA GATAGCGTGG 350 

ATGCTTTGCT GAAAAGTTAT TTCAATTCCG AGCTAGGAAA TGGTGGAGCT 400 

AAATATTCTG AAGGTGTCTA TGCAGTCGCC CTTAACCCAA AAACAGGTGC 450 

TGTTTTATCC ATGTCAGGGA TCAAACATGA CCTGAAAACG GGAGAGTTGA 500 

CTCCTGATTC CTTGGGAACG GTAACCAATG TCTTTGTCCC AGGTTCGGTT 550 

55 GTCAAGGCTG CGACCATCAG CTCAGGTTGG GAAAATGGTG TTTTATCAGG 600 

AAACCAAACC TTAACAGATC AGCCTATTGT TTTCCAAGGT TCAGCTCCAA 650 

TTTATTCTTG GTATAAATTG GCATATGGAT CTTTTCCTAT TACAGCTGTG 700 

GAAGCCTTGG AGTATTCATC CAATGCTTAC ATGGTTCAAA CCGCTCTTGG 750 

AATCATGGGC CAGACCTATC AACCAAATAT GTTTGTTGGA ACCAGCAATT 8 00 

60 TGGAAACAGC TATGGGAAAA CTTCGTGCGA CCTTTGGCGA ATATGGCTTG 850 
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GGGGCTGCGA CCGGAATTGA CCTACCAGAT GAATCTACTG GATTTGTTCC 900 

CAAAGAGTAT AGCTTTGCTA ATTACATCAC CAATTCCTTT GGGCAGTTTG 950 

ATAACTATAC GCCCATGCAG TTGGCTCAGT ATGTAGCAAC TATTGCAAAT 100 0 

AATGGTGTTC GTGTGGCTCC TCGTATTGTT GAAGGCATTT ATGGTAATAA 1050 

5 TGATAAGGGA GGACTGGGTG ACTTGATTCA GCAACTGCAA CCGACAGAGA 110 0 

TGAATAAGGT CAATATATCC GACTCCGATA TGAGCATCTT GCACCAAGGT 1150 

TTTTATCAGG TTGCCCATGG TACTAGTGGA TTGACAACTG GACGTGCCTT 12 00 

TTCAAATGGT GCCTTGGTAT CCATTAGCGG AAAAACAGGT ACAGCCGAAA 1250 

GCTATGTGGC AGATGGTCAG CAAGCAACCA ATACCAATGC GGTGGCCTAT 13 0 0 

10 GCCCCATCTG ATAATCCCCA AATCGCTGTC GCAGTGGTCT TTCCTCATAA 13 50 

TACCAATCTA ACAAATGGTG TAGGACCTTC CATTGCGCGT GACATTATCA 14 0 0 

ATCTGTATCA AAAATACCAT CCAATGAACT AGAAAGGAAA TTA 1443 



15 



2) INFORMATION FOR SEQ ID NO: 1027 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 144 3 bases 
20 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



25 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus pneumoniae 

(B) STRAIN: StrR-09 

30 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1027 

TATTGCCTCT ATTTCAAAGG AGATGCCTGG CATTAGTATT TCTACTTCTT 50 
GGGATAGAAA GGTTTTGGAA ACTTCCCTTT CTTCTATAGT TGGGAGTGTA 100 
TCCAGTGAAA AAGCTGGTCT CCCAGCGGAA GAAGCAGAAG CCTATCTTAA 150 

35 AAAAGGCTAT TCTCTAAATG ATCGTGTAGG AACCTCCTAT TTGGAAAAGC 20 0 

AATATGAAGA GACCTTACAA GGAAAACGCT CGGTAAAAGA AATCCATCTG 250 
GATAAATATG GCAATATGGA AAGCGTGGAT ACAATTGAGG AAGGTAGTAA 3 00 

GGGAAACAAT ATCAAACTGA CCATTGATTT GGCTTTCCAA GATAGCGTGG 3 50 

ATGCTTTACT GAAAAGTTAT TTCAATTCCG AGCTAGAAAA TGGTGGAGCC 4 00 

4 0 AAGTATTCTG AAGGTGTCTA TGCAGTCGCC CTTAACCCAA AAACAGGTGC 4 50 

TGTTTTGTCT ATGTCAGGGA TTAAACATGA CTTGAAAACG GGAGATTTAA 500 
CACCTGATTC CTTGGGAACA GTAACCAATG TCTTTGTCCC GGGTTCTGTT 550 
GTCAAGGCGG CGACCATCAG CTCTGGTTGG GAGAATGGAG TCTTATCAGG 600 
AAATCAGACC TTGACAGACC AACCGATTGT CTTCCAAGGT TCAGCTCCGA 650 

45 TTAATTCTTG GTACACTCAG GCTTACGATT CATTTCCGAT TACAGCGGTG 700 
GAAGCCTTGG AGTATTCTTC TAATGCCTAT ATGGTCCAAA CAGCTCTAGG 750 
TCTTATGGGG CAGACCTACC AACCCAATAT GTTTGTCGGC ACCAGCAATC 800 
TAGAGTCTGC TATGGGGAAA TTGCGTTCAA CCTTTGGTGA ATATGGCTTG 850 
GGCTCTGCGA CTGGGATTGA CCTACCAGAT GAATCTACTG GATTTGTTCC 900 

50 CAAAGAGTAT AGCTTTGCTA ATTACATTAC TAATGCCTTT GGGCAGTTTG 950 

ATAACTATAC GCCGATGCAG TTGGCTCAGT ATGTAGCAAC TATTGCAAAT 1000 

GATGGTGTTC GTGTGGCTCC TCGTATTGTG GAAGGCATTT ATGGCAATAA 1050 

TGATAAGGGA GGACTGGGTG ACTTGATTCA GCAACTGCAA CCGACAGAGA 1100 

TGAATAAGGT CAATATATCC GACTCCGATA TGAGTATCTT GCACCAAGGT 1150 

55 TTTTATCAGG TTGCTCATGG GACTAGCGGA TTGACAACAG GTCGTGCCTT 1200 

TTCCAATGGT GCAGCTGTAT CCATTAGTGG AAAAACAGGT ACCGCCGAAA 1250 

GTTATGTAGC AGGTGGCCAA GAAGCCAACA ATACTAATGC TGTAGCCTAT 13 00 

GCACCATCAG ATAATCCTCA AATAGCTGTT GCTGTTGTCT TCCCTCATAA 13 50 

CACCAATCTA ACAAATGGTG TAGGACCTTC CATTGCGCGT GATATTATCA 14 00 

60 ATCTGTATCA AAAATACCAT CCAATGAACT AGAAAGGAAT TTA 1443 
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2) INFORMATION FOR SEQ ID NO: 1028 

5 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1446 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 
10 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
15 (A) ORGANISM: Streptococcus pneumoniae 

(B) STRAIN: StrR-10 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1028 

20 TATTGCCTCT ATTTCAAAGG AGATGCCTGG CATTAGTATT TCTACCTCTT 50 

GGGATAGAAA GGTTTTGGAA ACTTCTCTTT CTTCTATAGT AGGGAGTGTA 100 

TCCAGTGAAA AAGCTGGTCT CCCAGCGGAA GAAGCAGAAG CCTATCTTAA 150 

AAAAGGCTAT TCTCTAAATG ACCGTGTTGG GACTTCTTAC CTGGAAAAAC 200 

AATACGAGGA AACCTTACAA GGAAAACGCT CGGTAAAAGA AATCCATCTG 250 

25 GATAAATATG GCAATATGGA AAGCGTGGAT ACAATTGAGG AAGGTAGTAA 300 

GGGAAACAAT ATCAAACTGA CCATTGATTT GGCCTTCCAA GATAGCGTGG 350 

ATGCTTTGCT GAAAAGTTAT TTCAATTCCG AGCTAGGAAA TGGTGGAGCT 400 

AAGTATTCTG AAGGCGTGTA TGCAGTCGCC CTTAACCCCA AAACAGGTGC 450 

TGTTTTGTCT ATGTCAGGAC TCAAACATGA CCTGAAAACG GGAGACTTGA 50 0 

30 CGCCTGATTC CTTGGGAACG GTAACCAATG TCTTTGTCCC AGGGTCAGTA 550 

GTTAAGGCCG CTACCATCAG CTCAGGTTGG GAAAATGGTG TTTTATCAGG 600 

AAACCAAACC TTAACAGATC AGCCTATTGT TTTCCAAGGT TCAGCTCCAA 650 

TTTATTCTTG GTATAAATTG GCATATGGAT CTTTTCCTAT TACAGCTGTG 700 

GAAGCCTTGG AGTATTCATC TAATGCTTAC ATGGTTCAAA CCGCTCTTGG 750 

35 AATCATGGGC CAGACCTATC AACCAAATAT GTTTGTTGGA ACCAGCAATT 800 

TGGAAACAGC TATGGGAAAA CTTCGTGCGA CCTTTGGCGA ATATGGCTTG 850 

GGGGCTGCGA CCGGAATTGA CCTACCAGAT GAATCTACTG GATTTGTTCC 900 

CAAAGAGTAT AGCTTTGCTA ATTACATTAC TAATGCCTTT GGGCAGTTTG 950 

ATAACTATAC GCCGATGCAG TTGGCTCAGT ATGTAGCAAC TATTGCAAAT 1000 

40 GATGGTGTTC GTGTGGCTCC TCGTATTGTT GAAGGCATTT ATGGTAATAA 1050 

TGATAAGGGA GGACTGGGTG ACTTGATTCA GCAACTGCAA CCGACAGAGA 1100 

TGAATAAGGT CAATATATCC GACTCCGATA TGAGCATCTT GCACCAAGGT 1150 

TTTTATCAGG TTGCCCATGG TACTAGTGGA TTGACAACTG GACGTGCCTT 1200 

TTCAAATGGC GCCTTGGTAT CCATTAGCGG AAAAACAGGT ACAGCCGAAA 1250 

45 GCTATGTGGC AGATGGTCAG CAAGCAACCA ATACCAATGC GGTGGCCTAT 1300 

GCCCCATCTG ATAATCCCCA AATCGCTGTT GCAGTGGTCT TTCCTCATAA 1350 

TACCAATCTA ACAAATGGTG TAGGACCTTC CATTGCGCGT GACATTATCA 1400 

ATCTGTATCA AAAATACCAT CCAATGAACT AGAAAGGAAA TTATGC 1446 



50 



2) INFORMATION FOR SEQ ID NO: 1029 



(i) SEQUENCE CHARACTERISTICS: 
55 (A) LENGTH: 1423 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

60 (ii) MOLECULE TYPE: Genomic DNA 
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(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus pneumoniae 

(B) STRAIN: StrR-11 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1029 

TTGCCTCTAT TTCAAAGGAG ATGCCTGGCA TTAGTATTTC TACTTCTTGG 50 

GATAGAAAGG TTTTGGAAAC TTCCCTTTCT TCTATAGTTG GGAGTGTATC 10 0 

CAGTGAAAAA GCTGGTCTCC CAGCGGAAGA AGCAGAAGCC TATCTTAAAA 150 

AAGGCTATTC TCTAAATGAC CGTGTAGGAA CCTCCTATTT GGAAAAGCAA 200 

TATGAAGAGA CCTTACAAGG AAAACGCTCG GTAAAAGAAA TCCATCTGGA 250 

TAAATATGGC AATATGGAAA GCGTGGATAC AATTGAGGAA GGTAGTAAGG 3 00 

GAAACAATAT CAAACTGACC ATTGATTTGG CCTTCCAAGA TAGCGTGGAT 350 

GCTTTACTGA AAAGTTATTT CAATTCTGAG CTAGAAAATG GTGGAGCCAA 4 00 

GTATTCTGAA GGTGTCTATG CAGTCGCCCT TAACCCAAAA ACAGGTGCGG 450 

TTTTGTCTAT GTCAGGGATT AAACATGACT TGAAAACGGG AGAGTTGACG 50 0 

CCTGATTCCT TGGGAACGGT AACCAATGTC TTTGTTCCAG GTTCGGTTGT 550 

CAAGGCGGCG ACCATCAGCT CAGGTTGGGA AAATGGAGTC TTGTCAGGAA 600 

ACCAGACCTT GACAGACCAG TCCATTGTCT TCCAAGGTTC AGCTCCCATC 650 

AATTCTTGGT ATACTCAGGC TTACGGTTCA TTCCCTATCA CAGCGGTCCA 700 

AGCTCTGGAG TATTCATCCA ATGCTTATAT GGTCCAAACA GCCTTAGGTC 750 

TTATGGGGCA GACCTATCAA CCCAATATGT TTGTCGGCAC CAGCAATCTA 80 0 

GAGTCTGCTA TGGGTAAATT GCGTTCAACC TTTGGCGAAT ATGGCTTGGG 850 

GGCTGCGACT GGGATTGATC TACCAGATGA ATCTACTGGA TTTGTTCCCA 900 

AAGACTATAA CTTTGCCAAT TTCATTACCA ATGCCTTTGG GCAGTTTGAT 950 

AACTATACCC CAATGCAATT GGCTCAGTAT GTAGCAACTA TTGCAAATGA 100 0 

TGGTGTTCGT GTGGCTCCTC GTATTGTTGA AGGCATTTAT GGTAATAATG 1050 

ATAAGGGAGG ACTGGGTGAC TTGATTCAGC AACTGCAACC GACAGAGATG 1100 

AATAAGGTCA ATATATCCGA CTCCGATATG AGTGTCTTGC ACCAAGGTTT 1150 

TTATCAGGTT GCTCATGGGA CTAGTGGGTT GACAACTGGC CGTGCCTTTT 1200 

CAAATGGTGC CTTGGTATCC ATTAGCGGAA AAACGGGTAC AGCCGAAAGC 1250 

TATGTGGCAG ATGGTCAGGA AGCAACCAAT ACCAATGCGG TGGCCTATGC 13 00 

CCCATCTGAT AATCCCCAAA TCGCTGTCGC AGTGGTCTTT CCTCATAATA 1350 

CCAATCTAAC AAATGGTGTA GGACCTTCCA TTGCGCGTGA CATTATCAAT 1400 

CTGTATCAAA AATACCATCC AAT 1423 



2) INFORMATION FOR SEQ ID NO: 1030 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1447 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus pneumoniae 

(B) STRAIN: StrR-12 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1030 

TATTGCCTCT ATTTCAAAGG AGATGCCTGG CATTAGTATT TCTACTTCTT 50 

GGGATCGAAA GGTTTTGGAA ACTTCCCTTT CTTCTATAGT AGGGAGTGTA 100 

TCCAGTGAAA AAGCTGGTCT CCCAGCGGAA GAAGCAGAAT CCTATCTTAA 150 

AAAAGGCTAT TCTCTAAATG ACCGTGTTGG AACCTCCTAT TTGGAAAAGC 2 00 

AATATGAAGA GACCTTACAA GGAAAACGCT CGGTAAAAGA AATCCATCTG 250 
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10 



15 



20 



25 



GATAAATATG 
GGGAAACAAT 
ATGCTTTGCT 
AAATATTCTG 
TGTTTTATCC 
CTCCTGATTC 
GTCAAGGCTG 
AAACCAAACC 
TTTATTCTTG 
GAAGCCTTGG 
AATCATGGGC 
TGGAAACAGC 
GGGGCTGCGA 
CAAAGAGTAT 
ATAACTATAC 
AATGGTGTTC 
TGATAAGGGA 
TGAATAAGGT 
TTTTATCAGG 
TTCAAATGGT 
GCTATGTGGC 
GCCCCATCTG 
TACCAATCTA 
ATCTGTATCA 



GCAATATGGA 
ATCAAACTGA 
GAAAAGTTAT 
AAGGTGTCTA 
ATGTCAGGGA 
CTTGGGAACG 
CGACCATCAG 
TTAACAGATC 
GTATAAATTG 
AGTATTCATC 
CAGACCTATC 
TATGGGAAAA 
CCGGAATTGA 
AGCTTTGCTA 
GCCCATGCAG 
GTGTGGCTCC 
GGACTGGGTG 
CAATATATCC 
TTGCCCATGG 
GCCTTGGTAT 
AGATGGTCAG 
ATAATCCCCA 
ACAAATGGTG 
AAAATACCAT 



AAGCGTGGAC 
CCATTGATTT 
TTCAATTCCG 
TGCAGTCGCC 
TCAAACATGA 
GTAACCAATG 
CTCAGGTTGG 
AGCCTATTGT 
GCATATGGAT 
CAATGCTTAC 
AACCAAATAT 
CTTCGTGCGA 
CCTACCAGAT 
ATTACATCAC 
TTGGCTCAGT 
TCGTATTGTT 
ACTTGATTCA 
GACTCCGATA 
TACTAGTGGA 
CCATTAGCGG 
CAAGCAACCA 
AATCGCTGTC 
TAGGAC CTTC 
CCAATGAACT 



ACAATTGAGG 
GGCCTTCCAA 
AGCTAGGAAA 
CTTAACCCAA 
CCTGAAAACG 
TCTTTGTCCC 
GAAAATGGTG 
TTTCCAAGGT 
CTTTTCCTAT 
ATGGTTCAAA 
GTTTGTTGGA 
CCTTTGGCGA 
GAATCTACTG 
CAATTCCTTT 
ATGTAGCAAC 
GAAGGCATTT 
GCAACTGCAA 
TGAGCATCTT 
TTGACAACTG 
AAAAACAGGT 
ATACCAATGC 
GCAGTGGTCT 
CATTGCGCGT 
AGAAAGGAAA 



AAGGTAGTAA 3 00 

GATAGCGTGG 350 

TGGTGGAGCT 400 

AAACAGGTGC 450 

GGAGAGTTGA 500 

AGGTTCGGTT 550 

TTTTATCAGG 600 

TCAGCTCCAA 650 

TACAGCTGTG 700 

CCGCTCTTGG 750 

ACCAGCAATT 800 

ATATGGCTTG 850 

GATTTGTTCC 900 

GGGCAGTTTG 950 

TATTGCAAAT 1000 

ATGGTAATAA 1050 

CCGACAGAGA 1100 

GCACCAAGGT 1150 

GACGTGCCTT 1200 

ACAGCCGAAA 1250 

GGTGGCCTAT 1300 

TTCCTCATAA 1350 

GACATTATCA 1400 

TTATGCT 1447 



2) INFORMATION FOR SEQ ID NO: 1031 



30 



35 



40 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 144 6 bases 

(B) . TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus pneumoniae 

(B) STRAIN: StrR-13 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1031 



ATTGCCTCTA 

45 GGATAGAAAG 
CCAGTGAAAA 
AAAGGCTATT 
ATACGAGGAA 
ATAAATATGG 

50 GGAAACAATA 
TGCTTTGCTG 
AGTATTCTGA 
GTTTTGTCTA 
GCCTGATTCC 

55 TTAAGGCCGC 
AACCAAACCT 
TTATTCTTGG 
AAGCCTTGGA 
ATCATGGGCC 

60 GGAAACAGCT 



TTTCAAAGGA 
GTTTTGGAAA 
AGCTGGTCTC 
CTCTAAATGA 
ACCTTACAAG 
CAATATGGAA 
TCAAACTGAC 
AAAAGTTATT 
AGGCGTGTAT 
TGTCAGGACT 
TTGGGAACGG 
TACCATCAGC 
TAACAGATCA 
TATAAATTGG 
GTATTCATCT 
AGACCTATCA 
ATGGGAAAAC 



GATGCCTGGC 
CTTCTCTTTC 
CCAGCGGAAG 
CCGTGTTGGG 
GAAAACGCTC 
AGCGTGGATA 
CATTGATTTG 
TCAATTCCGA 
GCAGTCGCCC 
CAAACATGAC 
TAACCAATGT 
TCAGGTTGGG 
GCCTATTGTT 
CATATGGATC 
AATGCTTACA 
ACCAAATATG 
TTCGTGCGAC 



ATTAGTATTT 
TTCTATAGTA 
AAGCAGAAGC 
ACTTCTTACC 
GGTAAAAGAA 
CAATTGAGGA 
GCCTTCCAAG 
GCTAGGAAAT 
TTAACCCCAA 
CTGAAAACGG 
CTTTGTCCCA 
AAAATGGTGT 
TTCCAAGGTT 
TTTTCCTATT 
TGGTTCAAAC 
TTTGTTGGAA 
CTTTGGCGAA 



CTACCTCTTG 


50 


GGGAGTGTAT 


100 


CTATCTTAAA 


150 


TGGAAAAACA 


200 


ATCCATCTGG 


250 


AGGTAGTAAG 


300 


ATAGCGTGGA 


350 


GGTGGAGCTA 


400 


AAGAGGTGCT 


450 


GAGACTTGAC 


500 


GGGTCAGTAG 


550 


TTTATCAGGA 


600 


CAGCTCCAAT 


650 


ACAGCTGTGG 


700 


CGCTCTTGGA 


750 


CCAGCAATTT 


800 


TATGGCTTGG 


850 
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ATGGTGTTCG TGTGGCTCCT CGTATTGTTG 
5 GATAAGGGAG GACTGGGTGA CTTGATTCAG 
GAATAAGGTC AATATATCCG ACTCCGATAT 
TTTATCAGGT TGCCCATGGT ACTAGTGGAT 
TCAAATGGCG CCTTGGTATC CATTAGCGGA 
CTATGTGGCA GATGGTCAGC AAGCAACCAA 
10 CCCCATCTGA TAATCCCCAA ATCGCTGTTG 
ACCAATCTAA CAAATGGTGT AGGACCTTCC 
TCTGTATCAA AAATACCATC CAATGAACTA 



PCT/CA00/01150 

AATCTACTGG ATTTGTTCCC 900 

AATGCCTTTG GGCAGTTTGA 950 

TGTAGCAACT ATTGCAAATG 1000 

AAGGCATTTA TGGTAATAAT 1050 

CAACTGCAAC CGACAGAGAT 1100 
GAGCATCTTG CACCAAGGTT . 1150 

TGACAACTGG ACGTGCCTTT 1200 

AAAACAGGTA CAGCCGAAAG 1250 

TACCAATGCG GTGGC CTATG 1300 

CAGTGGTCTT TCCTCATAAT 1350 

ATTGCGCGTG ACATTATCAA 1400 

GAAAGGAAAT TATGCT 1446 



15 



2) INFORMATION FOR SEQ ID NO: 1032 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 1446 bases 
20 (B) TYPE: Nucleic acid 

(C> STRANDEDNESS : Double 
(D) TOPOLOGY: Linear 



25 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus pneumoniae 

(B) STRAIN: StrR-14 

30 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1032 

ATTGCCTCTA TTTCAAAGGA GATGCCTGGC ATTAGTATTT CTACTTCTTG 50 

GGATAGAAAG GTTTTGGAAA CTTCCCTTTC TTCTATAGTT GGGAGTGTAT 100 

CCAGTGAAAA AGCTGGTCTC CCAGCGGAAG AAGCAGAAGC CTATCTTAAA 150 

35 AAAGGCTATT CTCTAAATGA TCGTGTTGGA ACCTCCTATT TGGAAAAGCA 2 00 

ATATGAAGAG ACCTTACAAG GGAAACGCTC GGTAAAAGAA ATCCATCTGG 250 

ATAAATATGG CAATATGGAA AGTGTGGATA CAATTGAGGA AGGTAGTAAG 300 

GGAAACAATA TCAAGCTGAC CATTGATTTG GCCTTCCAAG ATAGCGTGGA 350 

TGCTTTGCTG AAAAGTTATT TCAATTCCGA GCTAGGAAAT GGTGGAGCCA 4 00 

40 AGTATTCTGA GGGTGTGTAT GCAGTCGCCC TTAACCCCAA AACAGGTGCT 450 

GTTTTGTCTA TGTCAGGACT CAAACATGAC CTGAAAACGG GAGAGTTGAC 500 

TCCTGATTCC TTGGGAACGG TAACCAATGT CTTTGTCCCA GGTTCGGTTG 550 

TCAAGGCTGC GACCATCAGC TCTGGCTGGG AAAATGGAGT CTTATCAGGA 600 

AACCAGACCT TGACAGACCA GTCCATTGTC TTTCAAGGTT CAGCTCCCAT 650 

45 CAATTCTTGG TATACTCAGG CTTACGGTTC ATTCCCTATC ACAGCAGTCC 7 00 

AAGCTCTGGA GTATTCATCT AATGCCTATA TGGTCCAAAC AGCTTTAGGT 750 

CTTATGGGGC AGACCTACCA ACCTAATATG TTTGTCGGCA CCAGCAACCT 800 

AGAGTCTGCT ATGGGGAAAT TGCGTTCAAC CTTTGGTGAA TATGGTTTGG 850 

GTTCTGCGAC CGGGATTGAC CTACCAGATG AATCTACTGG ATTTGTTCCC 900 

50 AAAGACTATA GCTTTGCTAA TTACATCACC AATGCCTTTG GGCAGTTTGA 950 

TAACTATACG CCGATGCAGT TGGCTCAGTA TGTAGCAACT ATTGCAAATG 1000 

ATGGTGTTCG TGTGGCTCCT CGTATTGTTG AAGGCATTTA TGGAAATAAT 1050 

GATAAGGGAG GCCTAGGCGA CTTGATTCAG CAACTGCAAC CGACAGAGAT 1100 

GAATAAGGTC AATATATCCG ACTCTGATAT GAGTATTTTG CACCAAGGTT 1150 

55 TTTATCAGGT TGCTCATGGG ACTAGTGGAT TGACAACTGG ACGTGCCTTT 12 00 

TCAAATGGCG CAGCGGTATC CATTAGTGGA AAAACAGGTA CTGCCGAAAG 1250 

TTATGTTGAG GGTGGTCAAG AAGCTAACAA TACTAATGCT GTGGCCTATG 1300 

CACCATCAGA TAATCCTCAA ATCGCTGTAG CTGTTGTCTT CCCTCATAAC 1350 

ACCAACCTTA CAAATGGTGT CGGACCTTCC ATTGCGCGCG ATATTATCAA 1400 

60 CCTCTATAAC CAACATCATC CAATGAATTA GAAAGGAACA TATGCT 1446 
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2) INFORMATION FOR SEQ ID NO: 1033 

5 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1446 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 
10 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
15 (A) ORGANISM: Streptococcus pneumoniae 

(B) STRAIN: StrR-15 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1033 

2 0 TATTGCCTCT ATTTCAAAGG AGATGCCTGG CATTAGTATT TCTACTTCTT 50 

GGGATAGAAA GGTTTTGGAA ACTTCCCTTT CTTCTATAGT TGGGAGTGTA 100 

TCCAGTGAAA AAGCTGGTCT CCCAGCGGAA GAAGCAGAAG CCTATCTTAA 150 

AAAAGGCTAT TCTCTAAATG ACCGTGTTGG AACCTCGTAT TTGGAAAAGC 2 00 

AATATGAAGA GACCTTACAA GGAAAACGCT CGGTAAAAGA AATCCATCTG 2 50 

2 5 GATAAATATG GCAATATGGA AAGCGTGGAT ACAATTGAGG AAGGTAGTAA 3 00 

GGGAAACAAT ATCAAGCTGA CCATTGATTT GGCCTTCCAA GATAGCGTGG 350 

ATGCTTTGCT GAAAAGTTAT TTCAATTCCG AGCTAGGAAA TGGTGGAGCC 400 

AAGTATTCTG AGGGTGTGTA TGCAGTCGCC CTTAACCCCA AAACAGGTGC 4 50 

TGTTTTGTCT ATGTCAGGAC TCAAACATGA CCTGAAAACG GGAGAGTTGA 500 

3 0 CTCCTGATTC CTTGGGAACG GTAACCAATG TCTTTGTCCC AGGTTCGGTT 550 

GTTAAGGCCG CTACCATCAG CTCAGGTTGG GAAAATGGTG TTTTATCAGG 600 

AAACCAAACC TTAACAGATC AGCCTATTGT TTTCCAAGGT TCAGCTCCAA 650 

TTTATTCTTG GTATAAATTG GCATATGGAT CTTTTCCTAT TACAGCTGTG 700 

GAAGCCTTGG AGTATTCATC CAATGCTTAC ATGGTTCAAA CCGCTCTTGG 750 

35 AATCATGGGC CAGACCTATC AACCAAATAT GTTTGTTGGA ACCAGCAATT 800 

TGGAAACAGC TATGGGAAAA CTTCGTGCGA CCTTTGGCGA ATATGGCTTG 850 

GGGGCTGCGA CCGGAATTGA CCTACCAGAT GAATCTACTG GATTTGTTCC 900 

CAAAGAGTAT AGCTTTGCTA ATTACATCAC CAATGCCTTT GGGCAGTTTG 950 

ATAACTATAC GCCCATGCAG TTGGCTCAGT ATGTAGCAAC TATTGCAAAT 1000 

40 GATGGTGTTC GTGTGGCTCC TCGTATTGTT GAAGGCATTT ATGGTAATAA 1050 

TGATAAGGGA GGACTGGGTG ACTTGATTCA GCAACTGCAA CCGACAGAGA 1100 

TGAATAAGGT CAATATATCC GACTCCGATA TGAGTATCTT GCACCAAGGT 1150 

TTTTATCAGG TTGCTCATGG GACTAGTGGA TTGACAACTG GACGTGCTTT 12 00 

TTCAAATGGT GCCTTGGTAT CCATTAGTGG GAAAACAGGT ACTGCCGAAA 1250 

45 GTTATGTTGC AGGTGGTCAA GAAGCCAACA ATACCAACGC GGTGGCCTAT 13 00 

GCCCCATCAG ATAATCCTCA AATCGCTGTT GCCGTTGTCT TCCCTCATAA 13 50 

CACCAATCTA ACAAATGGTG TTGGACCTTC TATTGCACGC GATATTATCA 1400 

ACCTCTATAA CCAACACCAT CCAATGAATT AGAAAGGAAC TTATGC 1446 



50 



2) INFORMATION FOR SEQ ID NO: 1034 



(i) SEQUENCE CHARACTERISTICS: 
55 (A) LENGTH: 1670 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

60 (ii) MOLECULE TYPE: Genomic DNA 

579 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 



PCT/CA00/01150 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus pneumoniae 

(B) STRAIN: StrR-01 



5 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1034 

GGAAACTGCA GAGGTCAAGG GGATTGATTT TACAACCAGT CCCAATCGTA 
GTTATCCAAA CGGACAATTT GCTTCTAGTT TTATCGGACT AGCTCAGCTC 

10 CATGAAAATG AAGATGGCAG CAAGAGCTTA CTGGGAACCT CTGGAATGGA 
GAGTTCCTTG AACAGTATTC TTGCAGGGAC AGACGGTATT ATTAC CTATG 
AAAAAGACCG TGTAGGAAAT ATCGTACCAG GTACAGAACT GGTATCGCAA 
CAAACTGTGG ATGGCAAGGA TGTTTATACA ACATTGTCTA GTCCGCTACA 
ATCTTTCATG GAAACTCAGA TGGATGCCTT TCTAGAAAAA GTAAAAGGTA 

15 AGTATATGAC CGCGACCTTG GTCAGTGCAA AGACCGGTGA AATCCTCGCT 
ACCACCCAAC GACCTACCTT TAATGCAGAT ACTAAAGAAG GAATCACTGA 
GGACTTTGTT TGGCGTGATA TTCTTTATCA AAGTAACTAT GAACCAGGAT 
CAGCCATGAA GGTTATGACG TTAGCTTCTT CTATTGATAA TAATACCTTC 
CCAAGTGGAG AATACTTCAA TAGCAGTGAA TTCAAAATAG CGGATGCGAC 

20 GACTCGAGAT TGGGATGTTA ATGATGGTTT GACTACTGGT GGGATGATGA 
CTTTCTTACA AGGTTTCGCT CACTCCAGTA ATGTTGGAAT GAGTCTACTT 
GAACAAAAAA TGGGAGATGC TACTTGGTTG GATTATCTAA AACGCTTTAA 
ATTTGGGGTT CCAACTCGCT TTGGCTTGAC AGATGAATAC GCTGGTCAAC 
TTCCAGCTGA TAATATTGTT AGTATTGCTC AAAGCTCATT TGGGCAAGGA 

25 . ATTTCAGTGA CACAAACACA AATGCTTCGT GCCTTTACAG CTATTGCTAA 
TGATGGAGTT ATGCTGGAGC CAAAATTTAT AAGTGCTATT TATGATACTA 
ACAATCAGTC TGTACGTAAG TCACAAAAAG AAATAGTAGG AAATCCTGTT 
TCCAAAGAGG CAGCAAGCAC AACTCGAAAT CACATGATCT TAGTTGGGAC 
GGACCCTCTA TATGGAACTA TGTATAATCA CTACACAGGA AAGCCAATTA 

30 TAACAGTTCC TGGACAAAAT GTAGCAGTTA AATCCGGTAC GGCTCAAATC 
GCTGATGAGA AAAATGGAGG ATACTTGGTT GGTTCTACCA ATTATATTTT 
CTCAGTTGTG ACTATGAATC CTGCTGAAAA TCCTGATTTT ATCTTGTATG 
TAACGGTTCA ACAGCCTGAG CATTATTCAG GTATCCAGTT GGGAGAATTT 
GCCACCCCAA TCTTGGAGCG GGCTTCAGCT ATGAAAGAAT CTCTCAATCT 

35 TCAATCTCCA GCCAAAAATT TAGATAAAGT TACGACAGAA TCTTCTTATG 
CAATGCCTAG CATCAAGGAT ATTTCACCTG GTGAGTTGGC GGAAGCCTTA 
CGCCGAAATA TTGTGCAACC AATCGTTGTA GGTACTGGAA CAAAGATTAA 
AGAGACTTCT GTAGAAGAAG GGACCAATCT TGCACCAAAC CAACAAGTTC 
TCCTTTTATC GGATAAGGTA GAAGAAATTC CAGACATGTA TGGCTGGAAA 

40 AAAGAGACTG CCGAGACCTT TGCTAAATGG TTGGATATTG AACTGGAATT 
TGAAGGTTCA GGTTCCGTTG 



45 2) INFORMATION FOR SEQ ID NO: 1035 



50 
100 
150 
200 
250 
300 
350 
400 
450 
500 
550 
600 
650 
700 
750 
800 
850 
900 
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1000 
1050 
1100 
1150 
1200 
1250 
1300 
1350 
1400 
1450 
1500 
1550 
1600 
1650 
1670 



(i) 



SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1683 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



50 



(ii) 



MOLECULE TYPE: Genomic DNA 



55 



(vi) 



ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus pneumoniae 

(B) STRAIN: StrR-02 



(xi) 



SEQUENCE DESCRIPTION: SEQ ID NO: 1035 



60 



580 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 PCT/CA00/01150 

AAAGAATTGG AAACTGCAGA GGTCAAGGGG ATTGATTTTA CAACCAGTCC 50 
CAATCGTAGT TACCCAAATG GACAATTTGC TTCTAGTTTT ATCGGTCTAG 100 
CTCAGCTCCA TGAAAATGAA GATGGAAGCA AGAGTTTGCT GGGAACCTCT 150 
GGAATGGAGA GTTCCTTGAA CAGTATTCTT GCAGGGACAG ACGGCATTAT 200 
5 TACCTATGAA AAGGATCGTC TGGGCAATAT TGTACCCGGA ACAGAACAAG 250 
TTTCCCAACA AACGGTGGAT GGCAAGGATG TTTACACAAC CATTTCCAGC 3 00 

CCCCTCCAGT CCTTCATGGA AACTCAGATG GATGCCTTTC TAGAAAAAGT 350 
AAAAGGTAAG TATATGACCG CGACCTTGGT CAGTGCAAAG ACCGGTGAAA 400 
TCCTCGCTAC CACCCAACGA CCTACCTTTA ATGCAGATAC TAAAGAAGGA 450 

10 ATCACTGAGG ACTTTGTTTG GCGTGATATT CTTTATCAAA GTAACTATGA 500 
ACCAGGATCA GCCATGAAGG TTATGACGTT AGCTTCTTCT ATTGATAATA 550 
ATACCTTCCC AAGTGGAGAA TACTTCAATA GCAGTGAATT CAAAATAGCG 600 
GATGCGACGA CTCGAGATTG GGATGTTAAT GATGGTTTGA CTACTGGTGG 650 
GATGATGACT TTCTTACAAG GTTTCGCTCA CTCCAGTAAT GTTGGAATGA 700 

15 GTCTACTTGA ACAAAAAATG GGAGATGCTA CTTGGTTGGA TTATCTAAAA 750 
CGCTTTAAAT TTGGGGTTCC AACTCGCTTT GGCTTGACAG ATGAATACGC 800 
TGGTCAACTT CCAGCTGATA ATATTGTTAG TATTGCTCAA AGCTCATTTG 850 
GGCAAGGAAT TTCAGTGACA CAAACACAAA TGCTTCGTGC CTTTACAGCT 900 
ATTGCTAATG ATGGAGTTAT GCTGGAGCCA AAATTTATAA GTGCTATTTA 950 

20 TGATACTAAC AATCAGTCTG TACGTAAGTC ACAAAAAGAA ATAGTAGGAA 1000 

ATCCTGTTTC CAAAGAGGCA GCAAGCACAA CTCGAAATCA CATGATCTTA 1050 

GTTGGGACGG ACCCTCTATA TGGAACTATG TATAATCACT ACACAGGAAA 1100 

GCCAATTATA ACAGTTCCTG GACAAAATGT AGCAGTTAAA TCCGGTACGG 1150 

CTCAAATCGC TGATGAGAAA AATGGAGGAT ACTTGGTTGG TTCTACCAAT 12 00 

25 TATATTTTCT CAGTTGTGAC TATGAATCCT GCTGAAAATC CTGATTTTAT 1250 

CTTGTATGTA ACGGTTCAAC AGCCTGAGCA TTATTCAGGT ATCCAGTTGG 13 00 

GAGAATTTGC CACCCCAATC TTGGAGCGGG CTTCAGCTAT GAAAGAATCT 1350 

CTCAATCTTC AATCTCCAGC CAAAAATTTA GATAAAGTTA CGACAGAATC 14 00 

TTCTTATGCA ATGCCTAGCA TCAAGGATAT TTCACCTGGT GAGTTGGCGG 14 50 

30 AAGCCTTACG CCGAAATATT GTGCAACCAA TCGTTGTAGG TACTGGAACA 1500 

AAGATTAAAG AGACTTCTGT AGAAGAAGGG ACCAATCTTG CACCAAACCA 1550 

ACAAGTTCTC CTTTTATCGG ATAAGGTAGA AGAAATTCCA GACATGTATG 1600 

GCTGGAAAAA AGAGACTGCC GAGACCTTTG CTAAATGGTT GGATATTGAA 1650 

CTGGAATTTG AAGGTTCAGG TTCCGTTGTT CAG 1683 

35 



2 ) INFORMATION FOR SEQ ID NO : 1036 



40 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1682 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

45 

(ii) MOLECULE TYPE: Genomic DNA 



50 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus pneumoniae 

(B) STRAIN: StrR-03 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1036 



TCAAAAAAGA 
55 AGT CCCAATC 
TCTAGCTCAG 
CCTCTGGAAT 
ATTATTACCT 
ACAAGTTTCC 
60 CCAGCCCCCT 



ATTGGAAACT 
GTAGTTACCC 
CTCCATGAAA 
GGAGAGTTCC 
ATGAAAAGGA 
CAACAAACGG 
CCAGTCCTTC 



GCAGAGGTCA AGGGGATTGA 
AAATGGACAA TTTGCTTCTA 
ATGAAGATGG AAGCAAGAGT 
TTGAACAGTA TTCTTGCAGG 
TCGTCTGGGC AATATTGTAC 
TGGATGGCAA GGATGTTTAC 
ATGGAAACTC AGATGGATGC 

581 



TTTTACAACC 50 

GTTTTATCGG 10 0 

TTGCTGGGAA 150 

GACAGACGGC 200 

CCGGAACAGA 250 

ACAACCATTT 300 

CTTTCTAGAA 350 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 

AAAGTAAAAG 
TGAAATCCTC 
AAGGAATCAC 
TATGAACCAG 
5 TAATAATACC 
TAGCGGATGC 
GGTGGGATGA 
AATGAGTCTA 
TAAAACGCTT 

10 TACGCTGGTC 
ATTTGGGCAA 
CAGCTATTGC 
ATTTATGATA 
AGGAAATCCT 

15 TCTTAGTTGG 
GGAAAGCCAA 
TACGGCTCAA 
CCAATTATAT 
TTTATCTTGT 

2 0 GTTGGGAGAA 
AATCTCTCAA 
GAATCTTCTT 
GGCGGAAGCC 
GAACAAAGAT 

25 AACCAACAAG 
GTATGGCTGG 
TTGAACTGGA 



GTAAGTATAT 

GCTACCACCC 

TGAGGACTTT- 

GATCAGCCAT 

TTCCCAAGTG 

GACGACTCGA 

TGACTTTCTT 

CTTGAACAAA 

TAAATTTGGG 

AACTTCCAGC 

GGAATTTCAG 

TAATGATGGA 

CTAACAATCA 

GTTTCCAAAG 

GACGGACCCT 

TTATAACAGT 

ATCGCTGATG 

TTTCTCAGTT 

ATGTAACGGT 

TTTGCCACCC 

TCTTCAATCT 

ATGCAATGCC 

TTACGCCGAA 

TAAAGAGACT 

TTCTCCTTTT 

AAAAAAGAGA 

ATTTGAAGGT 



GACCGCGACC 

AACGACC TAC 

GTTTGGCGTG 

GAAGGTTATG 

GAGAATACTT 

GATTGGGATG 

ACAAGGTTTC 

AAATGGGAGA 

GTTCCAACTC 

TGATAATATT 

TGACACAAAC 

GTTATGCTGG 

GTCTGTACGT 

AGGCAGCAAG 

CTATATGGAA 

TCCTGGACAA 

AGAAAAATGG 

GTGACTATGA 

TCAACAGCCT 

CAATCTTGGA 

CCAGCCAAAA 

TAGCATCAAG 

ATATTGTGCA 

TCTGTAGAAG 

ATCGGATAAG 

CTGCCGAGAC 

TCAGGTTCCG 



TTGGTCAGTG 

CTTTAATGCA 

ATATTCTTTA 

ACGTTAGCTT 

CAATAGCAGT 

TTAATGATGG 

GCTCACTCCA 

TGCTACTTGG 

GCTTTGGCTT 

GTTAGTATTG 

ACAAATGCTT 

AGCCAAAATT 

AAGTCACAAA 

CACAACTCGA 

CTATGTATAA 

AATGTAGCAG 

AGGATACTTG 

ATCCTGCTGA 

GAGCATTATT 

GCGGGCTTCA 

ATTTAGATAA 

GATATTTCAC 

ACCAATCGTT 

AAGGGACCAA 

GTAGAAGAAA 

CTTTGCTAAA 

TT 



PCT/CA00/01150 

CAAAGACCGG 400 

GATACTAAAG 450 

TCAAAGTAAC 5 00 

CTTCTATTGA 550 

GAATTCAAAA 60 0 

TTTGACTACT 650 

GTAATGTTGG 700 

TTGGATTATC 750 

GACAGATGAA 8 00 

CTCAAAGCTC 850 

CGTGCCTTTA 900 

TATAAGTGCT 950 

AAGAAATAGT 1000 

AATCACATGA 1050 

TCACTACACA 110 0 

TTAAATCCGG 1150 

GTTGGTTCTA 1200 

AAATCCTGAT 1250 

CAGGTATCCA 1300 

GCTATGAAAG 1350 

AGTTACGACA 1400 

CTGGTGAGTT 14 50 

GTAGGTACTG 1500 

TCTTGCACCA 1550 

TTCCAGACAT 1600 

TGGTTGGATA 1650 
1682 



30 



40 



2) INFORMATION FOR SEQ ID NO: 103 7 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 1681 bases 
35 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus pneumoniae 

(B) STRAIN: StrR-04 

45 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1037 

CAAAAAAGAA TTGGAAACTG CAGAGGTCAA GGGGATTGAT TTTACAACCA 50 

GTCCCAATCG TAGTTATCCA AACGGACAAT TTGCTTCTAG TTTTATCGGA 100 

CTAGCTCAGC TCCATGAAAA TGAAGATGGC AGCAAGAGCT TACTGGGAAC 150 

50 CTCTGGAATG GAGAGTTCCT TGAACAGTAT TCTTGCAGGG ACAGACGGTA 2 00 

TTATTACCTA TGAAAAAGAC CGTGTAGGAA ATATCGTACC AGGTACAGAA 250 

CTGGTATCGC AACAAACTGT GGATGGCAAG GATGTTTATA CAACATTGTC 3 00 

TAGTCCGCTA CAATCTTTCA TGGAAACTCA GATGGATGCC TTTCTAGAAA 3 50 

AAGTAAAAGG TAAGTATATG ACCGCGACCT TGGTCAGTGC AAAGACCGGT 400 

55 GAAATCCTCG CTACCACCCA ACGACCTACC TTTAATGCAG ATACTAAAGA 450 

AGGAATCACT GAGGACTTTG TTTGGCGTGA TATTCTTTAT CAAAGTAACT 500 

ATGAACCAGG ATCAGCCATG AAGGTTATGA CGTTAGCTTC TTCTATTGAT 550 

AATAATACCT TCCCAAGTGG AGAATACTTC AATAGCAGTG AATTCAAAAT 600 

AGCGGATGCG ACGACTCGAG ATTGGGATGT TAATGATGGT TTGACTACTG 650 

60 GTGGGATGAT GACTTTCTTA CAAGGTTTCG CTCACTCCAG TAATGTTGGA 700 

582 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 



PCT/CA00/01150 



ATGAGTCTAC 
AAAACGCTTT 
ACGCTGGTCA 
TTTGGGCAAG 
5 AGCTATTGCT 
TTTATGATAC 
GGAAATCCTG 
CTTAGTTGGG 
GAAAGCCAAT 

10 ACGGCTCAAA 
CAATTATATT 
TTATCTTGTA 
TTGGGAGAAT 
ATCTCTCAAT 

15 AATCTTCTTA 
GCGGAAGCCT 
AACAAAGATT 
ACCAACAAGT 
TATGGCTGGA 

2 0 TGAACTGGAA 



TTGAACAAAA 
AAATTTGGGG 
ACTTCCAGCT 
GAATTTCAGT 
AATGATGGAG 
TAACAATCAG 
TTTCCAAAGA 
ACGGACCCTC 
TATAACAGTT 
TCGCTGATGA 
TTCTCAGTTG 
TGTAACGGTT 
TTGCCACCCC 
CTTCAATCTC 
TGCAATGCCT 
TACGCCGAAA 
AAAGAGACTT 
TCTCCTTTTA 
AAAAAGAGAC 
TTTGAAGGTT 



AATGGGAGAT 
TTCCAACTCG 
GATAATATTG 
GACACAAACA 
TTATGCTGGA 
TCTGTACGTA 
GGCAGCAAGC 
TATATGGAAC 
CCTGGACAAA 
GAAAAATGGA 
TGACTATGAA 
CAACAGCCTG 
AATCTTGGAG 
CAGCCAAAAA 
AGCATCAAGG 
TATTGTGCAA 
CTGTAGAAGA 
TCGGATAAGG 
TGCCGAGACC 
CAGGTTCCGT 



GCTACTTGGT 
CTTTGGCTTG 
TTAGTATTGC 
CAAATGCTTC 
GCCAAAATTT 
AGTCACAAAA 
ACAACTCGAA 
TATGTATAAT 
ATGTAGCAGT 
GGATACTTGG 
TCCTGCTGAA 
AGCATTATTC 
CGGGCTTCAG 
TTTAGATAAA 
ATATTTCACC 
CCAATCGTTG 
AGGGACCAAT 
TAGAAGAAAT 
TTTGCTAAAT 
T 



TGGATTATCT 750 

ACAGATGAAT 800 

TCAAAGCTCA 850 

GTGCCTTTAC 900 

ATAAGTGCTA 950 

AGAAATAGTA 1000 

ATCACATGAT 1050 

CACTACACAG 1100 

TAAATCCGGT 1150 

TTGGTTCTAC 12 00 

AATCCTGATT 12 50 

AGGTATCCAG 13 00 

CTATGAAAGA 1350 

GTTACGACAG 14 00 

TGGTGAGTTG 1450 

TAGGTACTGG 1500 

CTTGCACCAA 1550 

TCCAGACATG 1600 

GGTTGGATAT 1650 
1681 



25 



2) INFORMATION FOR SEQ ID NO: 1038 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1685 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 
3 0 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
35 (A) ORGANISM: Streptococcus pneumoniae 

(B) STRAIN: StrR-05 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1038 

40 AAAAAAGAAT TGGAAACTGC AGAGGTCAAG GGGATTGATT TTACAACCAG 50 

TCCCAATCGT AGTTATCCAA ACGGACAATT TGCTTCTAGT TTTATCGGAC 100 

TAGCTCAGCT CCATGAAAAT GAAGATGGCA GCAAGAGCTT ACTGGGAACC 150 

TCTGGAATGG AGAGTTCCTT GAACAGTATT CTTGCAGGGA CAGACGGTAT 2 00 

TATTACCTAT GAAAAAGACC GTGTAGGAAA TATCGTACCA GGTACAGAAC 2 50 

45 TGGTATCGCA ACAAACTGTG GATGGCAAGG ATGTTTATAC AACATTGTCT 3 00 

AGTCCGCTAC AATCTTTCAT GGAAACTCAG ATGGATGCCT TTCTAGAAAA 3 50 

AGTAAAAGGT AAGTATATGA CCGCGACCTT GGTCAGTGCA AAGACCGGTG 4 00 

AAATCCTCGC TACCACCCAA CGACCTACCT TTAATGCAGA TACTAAAGAA 450 

GGAATCACTG AGGACTTTGT TTGGCGTGAT ATTCTTTATC AAAGTAACTA 500 

50 TGAACCAGGA TCAGCCATGA AGGTTATGAC GTTAGCTTCT TCTATTGATA 550 

ATAATACCTT CCCAAGTGGA GAATACTTCA ATAGCAGTGA ATTCAAAATA 6 00 

GCGGATGCGA CGACTCGAGA TTGGGATGTT AATGATGGTT TGACTACTGG 650 

TGGGATGATG ACTTTCTTAC AAGGTTTCGC TCACTCCAGT AATGTTGGAA 700 

TGAGTCTACT TGAACAAAAA ATGGGAGATG CTACTTGGTT GGATTATCTA 750 

55 AAACGCTTTA AATTTGGGGT TCCAACTCGC TTTGGCTTGA CAGATGAATA 80 0 

CGCTGGTCAA CTTCCAGCTG ATAATATTGT TAGTATTGCT CAAAGCTCAT 850 

TTGGGCAAGG AATTTCAGTG ACACAAACAC AAATGCTTCG TGCCTTTACA 900 

GCTATTGCTA ATGATGGAGT TATGCTGGAG CCAAAATTTA TAAGTGCTAT 950 

TTATGATACT AACAATCAGT CTGTACGTAA GTCACAAAAA GAAATAGTAG 1000 

60 GAAATCCTGT TTCCAAAGAG GCAGCAAGCA CAACTCGAAA TCACATGATC 1050 

583 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 



PCT/CAOO/01150 



TTAGTTGGGA CGGACCCTCT ATATGGAACT ATGTATAATC ACTACACAGG 1100 

AAAGCCAATT ATAACAGTTC CTGGACAAAA TGTAGCAGTT AAATCCGGTA 1150 

CGGCTCAAAT CGCTGATGAG AAAAATGGAG GATACTTGGT TGGTTCTACC 1200 

AATTATATTT TCTCAGTTGT GACTATGAAT CCTGCTGAAA ATCCTGATTT 1250 

5 TATCTTGTAT GTAACGGTTC AACAGCCTGA GCATTATTCA GGTATCCAGT 1300 

TGGGAGAATT TGCCACCCCA ATCTTGGAGC GGGCTTCAGC TATGAAAGAA 1350 

TCTCTCAATC TTCAATCTCC AGCCAAAAAT TTAGATAAAG TTACGACAGA 1400 

ATCTTCTTAT GCAATGCCTA GCATCAAGGA TATTTCACCT GGTGAGTTGG 1450 

CGGAAGCCTT ACGCCGAAAT ATTGTGCAAC CAATCGTTGT AGGTACTGGA 1500 

10- ACAAAGATTA AAGAGACTTC TGTAGAAGAA GGGACCAATC TTGCACCAAA 1550 

CCAACAAGTT CTCCTTTTAT CGGATAAGGT AGAAGAAATT CCAGACATGT 1600 

ATGGCTGGAA AAAAGAGACT GCCGAGACCT TTGCTAAATG GTTGGATATT 1650 

GAACTGGAAT TTGAAGGTTC AGGTTCCGTT GTTCA 1685 



15 



30 



2 ) INFORMATION FOR SEQ ID NO: 1039 



(i) SEQUENCE CHARACTERISTICS: 
20 (A) LENGTH: 167 9 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

25 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus pneumoniae 

(B) STRAIN: StrR-06 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 103 9 



AAAAGAATTG GAAACTGCAG AGGTCAAGGG GATTGATTTT AC AAC CAGTC 50 

CCAATCGTAG TTATCCAAAC GGACAATTTG CTTCTAGTTT TATCGGACTA 100 

35 GCTCAGCTCC ATGAAAATGA AGATGGCAGC AAGAGCTTAC TGGGAACCTC 150 

TGGAATGGAG AGTTCCTTGA ACAGTATTCT TGCAGGGACA GACGGTATTA 200 

TTACCTATGA AAAAGACCGT GTAGGAAATA TCGTACCAGG TACAGAACTG 2 50 

GTATCGCAAC AAACTGTGGA TGGCAAGGAT GTTTATACAA C ATTGT C TAG 300 

TCCGCTACAA TCTTTCATGG AAACTCAGAT GGATGCCTTT CTAGAAAAAG 350 

40 TAAAAGGTAA GTATATGACC GCGACCTTGG TCAGTGCAAA GACCGGTGAA 400 

ATCCTCGCTA CCACCCAACG ACCTACCTTT AATGCAGATA CTAAAGAAGG 450 

AATCACTGAG GACTTTGTTT GGCGTGATAT TCTTTATCAA AGTAACTATG 500 

AACCAGGATC AGCCATGAAG GTTATGACGT TAGCTTCTTC TATTGATAAT 550 

AATACCTTCC CAAGTGGAGA ATACTTCAAT AGCAGTGAAT TCAAAATAGC 600 

45 GGATGCGACG ACTCGAGATT GGGATGTTAA TGATGGTTTG ACTACTGGTG 650 

GGATGATGAC TTTCTTACAA GGTTTCGCTC ACTCCAGTAA TGTTGGAATG 700 

AGTCTACTTG AACAAAAAAT GGGAGATGCT ACTTGGTTGG ATTATCTAAA 750 

ACGCTTTAAA TTTGGGGTTC CAACTCGCTT TGGCTTGACA GATGAATACG 8 00 

CTGGTCAACT TCCAGCTGAT AATATTGTTA GTATTGCTCA AAGCTCATTT 850 

50 GGGCAAGGAA TTTCAGTGAC ACAAACACAA ATGCTTCGTG CCTTTACAGC 900 

TATTGCTAAT GATGGAGTTA TGCTGGAGCC AAAATTTATA AGTGCTATTT 950 

ATGATACTAA CAATCAGTCT GTACGTAAGT CACAAAAAGA AATAGTAGGA 1000 

AATCCTGTTT CCAAAGAGGC AGCAAGCACA ACTCGAAATC ACATGATCTT 1050 

AGTTGGGACG GACCCTCTAT ATGGAACTAT GTATAATCAC TACACAGGAA 1100 

55 AGCCAATTAT AACAGTTCCT GGACAAAATG TAGCAGTTAA ATCCGGTACG 1150 

GCTCAAATCG CTGATGAGAA AAATGGAGGA TACTTGGTTG GTTCTACCAA 12 00 

TTATATTTTC TCAGTTGTGA CTATGAATCC TGCTGAAAAT CCTGATTTTA 1250 

TCTTGTATGT AACGGTTCAA CAGCCTGAGC ATTATTCAGG TATCCAGTTG 13 00 

GGAGAATTTG CCACCCCAAT CTTGGAGCGG GCTTCAGCTA TGAAAGAATC 1350 

60 TCTCAATCTT CAATCTCCAG CCAAAAATTT AGATAAAGTT ACGACAGAAT 1400 

584 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 

CTTCTTATGC AATGCCTAGC ATCAAGGATA 
GAAGCCTTAC GCCGAAATAT TGTGCAACCA 
AAAGATTAAA GAGACTTCTG TAGAAGAAGG 
AACAAGTTCT CCTTTTATCG GATAAGGTAG 
5 GGCTGGAAAA AAGAGACTGC CGAGACCTTT 
ACTGGAATTT GAAGGTTCAG GTTCCGTTG 



PCT/CA00/01150 

TTTCACCTGG TGAGTTGGCG 1450 
ATCGTTGTAG GTACTGGAAC 1500 
GACCAATCTT GCACCAAACC 1550 
AAGAAATTCC AGACATGTAT 1600 
GCTAAATGGT TGGATATTGA 1650 

1679 



10 2) INFORMATION FOR SEQ ID NO: 1040 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1685 bases 

(B) TYPE: Nucleic acid 
15 (C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

20 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus pneumoniae 

(B) STRAIN: StrR-07 



25 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1040 



TAAAAAGGAT CTAAAAGACG CTAGTGTTGA AGGAATTGAC TTCACAACTA 50 

GCCCTAATAG AAGCTATCCA AATGGACAAT TCGCTTCTAG TTTTATTGGT 10 0 

TTGGCCCAAC TCCATGAAAA TGAGGATGGT AGCAAGAGTT TGCTGGGAAC 150 

TTCTGGGATG GAGAGTTCCT TGAACAGTAT TCTTGCAGGG AAAGACGGTA 200 

30 TTATTAC CTA TGAAAAAGAT CGTCTGGGTA ATATTGTCCC TGGAACAGAA 2 50 

CAAGTTTCCC AACAAACGGT AGATGGCAAG GATGTTTATA CAACATTGTC 300 

TAGTCCGCTA CAATCTTTCA TGGAAACTCA GATGGATGCC TTTCTAGAAA 350 

AAGTAAAAGG TAAGTATATG ACCGCGACCT TGGTCAGTGC AAAGACCGGT 400 

GAAATCCTCG CTACCACCCA ACGACCTACC TTTAATGCAG ATACTAAAGA 4 50 

35 AGGAATCACT GAGGACTTTG TTTGGCGTGA TATTCTTTAT CAAAGTAACT 500 

ATGAACCAGG ATCAGCCATG AAGGTTATGA CGTTAGCTTC TTCTATTGAT 550 

AATAATACCT TCCCAAGTGG AGAATACTTC AATAGCAGTG AATTCAAAAT 600 

AGCGGATGCG ACGACTCGAG ATTGGGATGT TAATGAAGGT TTGACTACTG 650 

GTGGGATGAT GACTTTCTTA CAAGGTTTCG CTCACTCCAG TAATGTTGGA 700 

40 ATGAGTCTAC TTGAACAAAA AATGGGAGAT GCTACTTGGT TGGATTATCT 750 

AAAACGCTTT AAATTTGGGG TTCCAACTCG CTTTGGCTTG ACAGATGAAT 800 

ACGCTGGTCA ACTTCCAGCT GATAATATTG TTAGTATTGC TCAAAGCTCA 850 

TTTGGGCAAG GAATTTCAGT GACACAAACA CAAATGCTTC GTGCCTTTAC 900 

AGCTATTGCT AATGATGGAG TTATGCTGGA GCCAAAATTT ATAAGTGCTA 950 

45 TTTATGATAC TAACAATCAG TCTGTACGTA AGTCACAAAA AGAAATAGTA 1000 

GGAAATCCTG TTTCCAAAGA GGCAGCAAGC ACAACTCGAA ATCACATGAT 1050 

CTTAGTTGGG ACGGACCCTC TATATGGAAC TATGTATAAT CACTACACAG 1100 

GAAAGCCAAT TATAACAGTT CCTGGACAAA ATGTAGCAGT TAAATCCGGT 1150 

ACGGCTCAAA TCGCTGATGA GAAAAATGGA GGATACTTGG TTGGTTCTAC 12 00 

50 CAATTATATT TTCTCAGCTG TGACTATGAA TCCTGCTGAA AATCCTGATT 12 50 

TTATCTTGTA TGTAACGGTT CAACAGCCTG AGCATTATTC AGGTATCCAG 13 00 

TTGGGAGAAT TTGCCACCCC AATCTTGGAG CGGGCTTCAG CTATGAAAGA 13 50 

ATCTCTCAAT CTTCAATCTC CAGCCAAAAA TTTAGATAAA GTTACGACAG 1400 

AATCTTCTTA TGCAATGCCT AGCATCAAGG ATATTTCACC TGGTGAGTTG 1450 

55 GCGGAAGCCT TACGCCGAAA TATTGTGCAA CCAATCGTTG TAGGTACTGG 1500 

AACAAAGATT AAAGAGACTT CTGTAGAAGA AGGGACCAAT CTTGCACCAA 1550 

ACCAACAAGT TCTCCTTTTA TCGGATAAGG TAGAAGAAAT TCCAGACATG 1600 

TATGGCTGGA AAAAAGAGAC TGCCGAGACC TTTGCTAAAT GGTTGGATAT 1650 

TGAACTGGAA TTTGAAGGTT CAGGTTCCGT CGTTC 1685 



60 



585 
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2) INFORMATION FOR SEQ ID NO: 1041 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1696 bases 

(B) TYPE: Nucleic acid 

' (C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus pneumoniae 
15 <B) STRAIN: StrR-08 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1041 

GTCTATCAAA AAAGAATTGG AAACTGCAGA GGTCAAGGGG ATTGATTTTA 50 

20 CAACCAGTCC CAATCGTAGT TATCCAAACG GACAATTTGC TTCTAGTTTT 100 

ATCGGACTAG CTCAGCTCCA TGAAAATGAA GATGGCAGCA AGAGCTTACT 150 

GGGAACCTCT GGAATGGAGA GTTCCTTGAA CAGTATTCTT - GCAGGGACAG 200 

ACGGTATTAT TACCTATGAA AAAGACCGTG TAGGAAATAT CGTACCAGGT 250 

ACAGAACTGG TATCGCAACA AACTGTGGAT GGCAAGGATG TTTATACAAC 300 

25. ATTGTCTAGT CCGCTACAAT CTTTCATGGA AACTCAGATG GATGCCTTTC 350 

TAGAAAAAGT AAAAGGTAAG TATATGACCG CGACCTTGGT CAGTGCAAAG 4 00 

ACCGGTGAAA TCCTCGCTAC CACCCAACGA CCTACCTTTA ATGCAGATAC 450 

TAAAGAAGGA ATCACTGAGG ACTTTGTTTG GCGTGATATT CTTTATCAAA 500 

GTAACTATGA ACCAGGATCA GCCATGAAGG TTATGACGTT AGCTTCTTCT 550 

30 ATTGATAATA ATACCTTCCC AAGTGGAGAA TACTTCAATA GCAGTGAATT 60 0 

CAAAATAGCG GATGCGACGA CTCGAGATTG GGATGTTAAT GATGGTTTGA 650 

CTACTGGTGG GATGATGACT TTCTTACAAG GTTTCGCTCA CTCCAGTAAT 700 

GTTGGAATGA GTCTACTTGA ACAAAAAATG GGAGATGCTA CTTGGTTGGA 750 

TTATCTAAAA CGCTTTAAAT TTGGGGTTCC AACTCGCTTT GGCTTGACAG 800 

35 ATGAATACGC TGGTCAACTT CCAGCTGATA ATATTGTTAG TATTGCTCAA 850 

AGCTCATTTG GGCAAGGAAT TTCAGTGACA CAAACACAAA TGCTTCGTGC 900 

CTTTACAGCT ATTGCTAATG ATGGAGTTAT GCTGGAGCCA AAATTTATAA 950 

GTGCTATTTA TGATACTAAC AATCAGTCTG TACGTAAGTC ACAAAAAGAA 10 00 

ATAGTAGGAA ATCCTGTTTC .CAAAGAGGCA GCAAGCACAA CTCGAAATCA 1050 

40 CATGATCTTA GTTGGGACGG ACCCTCTATA TGGAACTATG TATAATCACT 1100 

ACACAGGAAA GCCAATTATA ACAGTTCCTG GACAAAATGT AGCAGTTAAA 1150 

TCCGGTACGG CTCAAATCGC TGATGAGAAA AATGGAGGAT ACTTGGTTGG 1200 

TTCTACCAAT TATATTTTCT CAGTTGTGAC TATGAATCCT GCTGAAAATC 1250 

CTGATTTTAT CTTGTATGTA ACGGTTCAAC AGCCTGAGCA TTATTCAGGT 1300 

45 ATCCAGTTGG GAGAATTTGC CACCCCAATC TTGGAGCGGG CTTCAGCTAT 1350 

GAAAGAATCT CTCAATCTTC AATCTCCAGC CAAAAATTTA GATAAAGTTA 1400 

CGACAGAATC TTCTTATGCA ATGCCTAGCA TCAAGGATAT TTCACCTGGT 1450 

GAGTTGGCGG AAGCCTTACG CCGAAATATT GTGCAACCAA TCGTTGTAGG 1500 

TACTGGAACA AAGATTAAAG AGACTTCTGT AGAAGAAGGG ACCAATCTTG 1550 

50 CACCAAACCA ACAAGTTCTC CTTTTATCGG ATAAGGTAGA AGAAATTCCA 1600 

GACATGTATG GCTGGAAAAA AGAGACTGCC GAGACCTTTG CTAAATGGTT 1650 

GGATATTGAA CTGGAATTTG AAGGTTCAGG TTCCGTTGTT CAGAAG 1696 



55 



2) INFORMATION FOR SEQ ID NO: 1042 



. (i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 1689 bases 
60 (B) TYPE: Nucleic acid 
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(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE : 

(A) ORGANISM: Streptococcus pneumoniae 

(B) STRAIN: StrR-09 



10 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1042 

AAAAAAGAGT TGGAAACTGC AGAGGTCAAG GGGATTGATT TTACAACCAG 50 

TCCCAATCGT AGTTACCCAA ACGGACAATT TGCTTCTAGT TTTATCGGAC 100 

TAGCTCAGCT CCATGAAAAT GAAGATGGCA GCAAGAGCTT GCTGGGAACC 150 

15 TCTGGAATGG AGAGTTCCTT GAACAGTATT CTTGCAGGGA CAGACGGCAT 2 00 

TATTACCTAT GAAAAGGATC GTCTGGGAAA TATTGTCCCC GGAACGGAAC 250 

AAGTTTCCCA ACAAACTGTA GATGGCAAGG ATGTTTATAC GACTATTTCC 300 

AGCACCCTTC AGTCCTTCAT GGAAACTCAG ATGGATGCCT TTCTAGAAAA 350 

AGTAAAAGGT AAGTATATGA CCGCGACCTT GGTCAGTGCA AAGACCGGTG 400 

20 AAATCCTCGC TACCACCCAA CGACCTACCT TTAATGCAGA TACTAAAGAA 450 

GGAATCACTG AGGACTTTGT TTGGCGTGAT ATTCTTTATC AAAGTAACTA 500 

TGAACCAGGA TCAGCCATGA AGGTTATGAC GTTAGCTTCT TCTAT TGATA 550 
ATAATACCTT CCCAAGTGGA GAATACTTCA ATAGCAGTGA ATTCAAAATA ■ 600 

GCGGATGCGA CGACTCGAGA TTGGGATGTT AATGATGGTT TGACTACTGG 65 0 

2 5 TGGGATGATG ACTTTCTTAC AAGGTTTCGC TCACTCCAGT AATGTTGGAA 700 

TGAGTCTACT TGAACAAAAA ATGGGAGATG CTACTTGGTT GGATTATCTA 750 

AAACGCTTTA AATTTGGGGT TCCAACTCGC TTTGGCTTGA CAGATGAATA 80 0 

CGCTGGTCAA CTTCCAGCTG ATAATATTGT TAGTATTGCT CAAAGCTCAT 850 

TTGGGCAAGG AATTTCAGTG ACACAAACAC AAATGCTTCG TGCCTTTACA 900 

30 GCTATTGCTA ATGATGGAGT TATGCTGGAG CCAAAATTTA TAAGTGCTAT 95 0 

TTATGATACT AACAATCAGT CTGTACGTAA GTCACAAAAA GAAATAGTAG 1000 

GAAATCCTGT TTCCAAAGAG GCAGCAAGCA CAACTCGAAA TCACATGATC 1050 

TTAGTTGGGA CGGACCCTCT ATATGGAACT ATGTATAATC ACTACACAGG 1100 

AAAGCCAATT ATAACAGTTC CTGGACAAAA TGTAGCAGTT AAATCCGGTA 1150 

35 CGGCTCAAAT CGCTGATGAG AAAAATGGAG GATACTTGGT TGGTTCTACC 1200 

AATTATATTT TCTCAGTTGT GACTATGAAT CCTGCTGAAA ATCCTGATTT 12 50 

TATCTTGTAT GTAACGGTTC AACAGCCTGA GCATTATTCA GGTATCCAGT 13 00 

TGGGAGAATT TGCCACCCCA ATCTTGGAGC GGGCTTCAGC TATGAAAGAC 1350 

TCTCTCAATC TTCAAACAAC AGCTAAAGCT TTGGAGCAAG TAAGTCAACA 14 0 0 

4 0 AAGTCCTTAT CCTATGCCTA GTGTCAAGGA TATTTCACCT GGTGATTTAG 1450 

CAGAAGAATT GCGTCGCAAT CTTGTACAAC CCATCGTTGT GGGAACAGGA 1500 

ACGAAGATTA AAAACAGTTC TGCTGAAGAA GGGAAGAATC TTGCCCCGAA 1550 

TCAGCAAGTC CTTATCTTAT CTGATAAAGT AGAGGAAGTT CCAGATATGT 1600 

ATGGTTGGAC AAAGGAGACT GCTGAGACCC TTGCTAAGTG GCTCAATATA 1650 

4 5 GAACTTGAAT TTCAAGGTTC GGGTTCTACT GTGCAGAAG 1689 



50 



2) INFORMATION FOR SEQ ID NO: 1043 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1690 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
55 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
60 (A) ORGANISM: Streptococcus pneumoniae 

587 
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(B) STRAIN: StrR-10 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1043 

5 CAAAAAAGAG TTGGAAACTG CAGAGGTCAA GGGGATTGAT TTTACAACCA 50 

GTCCTAATCG TAGTTACCCA AACGGACAAT TTGCTTCTAG TTTTATCGGT 100 

CTAGCTCAAC TCCATGAAAA TGAAGATGGA AGCAAGAGTT TGCTGGGAAC 150 

TTCTGGAATG GAGAGTTCCT TGAACAGTAT TCTTGCAGGG AAAGACGGTA 200 

TTATTACTTA TGAAAAGGAT CGTCTGGGTA ATATTGTCCC TGGAACAGAA 250 

10 CAAGTTTCCC AACAAACGGT AGATGGCAAG GATGTTTATA CAACATTGTC 3 00 

TAGTCCGCTC CAATCTTTCA TGGAAACTCA GATGGATGCC TTTCTAGAAA 350 

AAGTAAAAGG TAAGTATATG ACCGCGACCT TGGTCAGTGC AAAGACCGGT 400 

GAAATCCTCG CTACCACCCA ACGACCTACC TTTAATGCAG ATACTAAAGA 450 

AGGAATCACT GAGGACTTTG TTTGGCGTGA TATTCTTTAT CAAAGTAACT 500 

15 ATGAACCAGG AT CAGC CATG AAGGTTATGA CGTTAGCTTC TTCTATTGAT 55 0 

AATAATACCT TCCCAAGTGG AGAATACTTC AATAGCAGTG AATTCAAAAT 600 

AGCGGATGCG ACGACTCGAG ATTGGGATGT TAATGATGGT TTGACTACTG 65 0 

GTGGGATGAT GACTTTCTTA CAAGGTTTCG CTCACTCCAG TAATGTTGGA 70 0 

ATGAGTCTAC TTGAACAAAA AATGGGAGAT GCTACTTGGT TGGATTATCT 750 

20 AAAACGCTTT AAATTTGGGG TTCCAACTCG CTTTGGCTTG ACAGATGAAT 80 0 

ACGCTGGTCA ACTTCCAGCT GATAATATTG TTAGTATTGC TCAAAGCTCA 850 

TTTGGGCAAG GAATTTCAGT GACACAAACA CAAATGCTTC GTGCCTTTAC 900 

AGCTATTGCT AATGATGGAG TTATGCTGGA GCCAAAATTT ATAAGTGCTA 950 

TTTATGATAC TAACAATCAG TCTGTACGTA AGTCACAAAA AGAAATAGTA 1000 

25 GGAAATCCTG TTTCCAAAGA GGCAGCAAGC ACAACTCGAA ATCACATGAT 1050 

CTTAGTTGGG ACGGACCCTC TATATGGAAC TATGTATAAT CACTACACAG 1100 

GAAAGCCAAT TATAACAGTT CCTGGACAAA ATGTAGCAGT TAAATCCGGT 1150 

ACGGCTCAAA TCGCTGATGA GAAAAATGGA GGATACTTGG TTGGTTCTAC 12 00 

CAATTATATT TTCTCAGTTG TGACTATGAA TCCTGCTGAA AATCCTGATT 1250 

3 0 TTATCTTGTA TGTAACGGTT CAACAGCCTG AGCATTATTC AGGTATCCAG 130 0 

TTGGGAGAAT TTGCCACCCC AATCTTGGAG CGGGCTTCAG CTATGAAAGA 1350 

ATCTCTCAAT CTTCAATCTC CAGCCAAAAA TTTAGATAAA GTTACGACAG 14 00 

AATCTTCTTA TGCAATGCCT AGCATCAAGG ATATTTCACC TGGTGAGTTG 1450 

GCGGAAGCCT TACGCCGAAA TATTGTGCAA CCAATCGTTG TAGGTACTGG 1500 

35 AACAAAGATT AAAGAGACTT CTGTAGAAGA AGGGACCAAT CTTGCACCAA 1550 

ACCAACAAGT TCTCCTTTTA TCGGATAAGG TAGAAGAAAT TCCAGACATG 1600 

TATGGCTGGA AAAAAGAGAC TGCCGAGACC TTTGCTAAAT GGTTGGATAT 1650 

TGAACTGGAA TTTGAAGGTT CAGGTTCCGT TGTTCAGAAG 1690 



40 



2) INFORMATION FOR SEQ ID NO: 1044 



(i) SEQUENCE CHARACTERISTICS: 
45 (A) LENGTH: 1668 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

50 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus pneumoniae 

(B) STRAIN: StrR-11 

55 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1044 



AGTTGGAAAC TGCAGAGGTC AAGGGGATTG ATTTTACAAC CAGTCCTAAT 50 

CGTAGTTACC CAAACGGACA ATTTGCTTCT AGTTTTATCG GACTAGCTCA 100 

60 GCTCCATGAA AATGAAGATG GCAGCAAGAG CTTGCTGGGA ACCTCTGGGA 150 
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TGGAGAGTTC 
TATGAAAAGG 
CCAACAAACG 
TTCAGTCCTT 
5 GGCAAGTATA 
TGCAACAACG 
CCAAGGACTT 
GGGTCAACCA 
TTTCCCAGGA 

10 CGACAATTCG 
ATGACTTTCT 
ACTTGAACAA 
TTAAGTTTGG 
CAATTGCCTG 

15 GGGGATTTCA 
CTAATGATGG 
CCAAATGATC 
TGTTTCTAAA 
GGACGGATCC 

20 ACTGTAACTG 
GATTGCTGAC 
TTTTCTCGGC 
TATGTAACGG 
ATTTGCCACC 

25 ATCTTCAATC 
TATGCAATGC 
CTTACGCCGA 
TTAAAGAGAC 
GTTCTCCTTT 

30 GAAAAAAGAG 
AATTTGAAGG 



TTTAAATAGA 
ATCGTCTGGG 
GTAGATGGCA 
CATGGAGACC 
TGACGGCTAC 
CAACGGCCGA 
TGTTTGGCGT 
TGAAGGTCAT 
GGAGAAGTTT 
AGATTGGGAT 
TACAAGGTTT 
AAAATGGGAG 
GGTGCCGACG 
CAGATAATAT 
GTGACCCAGA 
TGTTATGCTG 
AAACTGCTCG 
GATGCAGCTA 
GGTTTATGGA 
TTCCTGGGCA 
GAGAAAAATG 
TGTATCGATG 
TACAGCAACC 
CCAATCTTGG 
TCCAGCCAAG 
CTAGCATCAA 
AATATTGTGC 
TTCTGTAGAA 
TATCGGATAA 
ACTGCTGAAA 
GTCAGGTT 



ATTCTTGCAG 
AAATATTGTC 
AGGATGTTTA 
CAGATGAATG 
CTTGGTCAGT 
CCTTCGATGC 
GATATCCTCT 
GACGCTTGCT 
TCAATAGTAG 
GTTAATGATG 
CGCTCACTCC 
ATGCTACTTG 
CGTTTTGGTC 
TGTTAATATT 
CGCAAATGAT 
GAGCCTAAAT 
GAAATCTCAA 
GTCTAACTCG 
ACCATGTATA 
AAATGTAGCC 
GTGGTTATCT 
AATCCGGCTG 
TGAACATTAT 
AGCGGGCTTC 
AATTTAGATA 
GGATATTTCA 
AACCAATCGT 
GAAGGGACCA 
GGTAGAAGAA 
CCTTTGCTAA 



GGACAGACGG 
CCCGGAACGG 
TACGACTATT 
CCTTTCAAGA 
GCTAAAACAG 
TGATACTAAG 
ATCAAAGTAA 
GCTGCTATTG 
TGAATTAAAA 
GTTTGACGAC 
AGTAATGTTG 
GTTGGATTAT 
TGACTGATGA 
GCCATGAGTG 
TCGTGCCTTT 
TTATTAGTGC 
AAAGAAATTG 
GACTAACATG 
ACCACAGCAC 
CTCAAGTCTG 
AGTCGGGTTA 
AAAATCCTGA 
TCAGGTATCC 
AGCTATGAAA 
AAGTTACGAC 
CCTGGTGAGT 
TGTAGGTACT 
ATCTTGCACC 
ATTCCAGACA 
ATGGTTGGAT 



CATTATTACC 2 00 

AACAAGTTTC 250 

TCCAGCACCC 3 00 

AAAAGTAAAA 350 

GGGAAATTCT 4 00 

GAAGGACTTA 450 

CTATGAGCCA 500 

ATAATAACAC 550 

ATAGCGGATG 600 

TGGTGGGATG 650 

GAATGAGTCT 700 

CTAAACCGCT 750 

GTATTCAGGT 800 

CATTTGGTCA 850 

ACAGCTATTG 900 

CATTTATGAT 950 

TGGGAAATCC 1000 

ATTTTGGTAG 1050 

AGGTAAGCCA 1100 

GTACGGCTCA 1150 

ACCAACTATA 1200 

TTTTATCTTG 1250 

AGTTGGGAGA 13 00 

GAATCTCTCA 13 50 

AGAATCTTCT 1400 

TGGCGGAAGC 1450 

GGAACAAAGA 1500 

AAACCAACAA 1550 

TGTATGGCTG 1600 

ATTGAGCTGG 1650 
1668 



35 2) INFORMATION FOR SEQ ID NO: 1045 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1680 bases 

(B) TYPE: Nucleic acid 
40 (C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

45 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus pneumoniae 

(B) STRAIN: StrR-12 



50 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1045 



TTGGAAACTG CAGAGGTCAA GGGGATTGAT TTTACAACCA GTCCCAATCG 50 

TAGTTACCCA AATGGACAAT TTGCTTCTAG TTTTATCGGT CTAGCTCAGC 100 

TCCATGAAAA TGAAGATGGA AGCAAGAGTT TGCTGGGAAC CTCTGGAATG 150 

GAGAGTTCCT TGAACAGTAT TCTTGCAGGG ACAGACGGCA TTATTACCTA 200 

55 TGAAAAGGAT CGTCTGGGCA ATATTGTACC CGGAACAGAA CAAGTTTCCC 250 

AACAAACGGT GGATGGCAAG GATGTTTACA CAACCATTTC CAGCCCCCTC 3 00 

CAGTCCTTCA TGGAAACTCA GATGGATGCC TTTCTAGAAA AAGTAAAAGG 350 

TAAGTATATG ACCGCGACCT TGGTCAGTGC AAAGACCGGT GAAATCCTCG 400 

CTACCACCCA ACGACCTACC TTTAATGCAG ATACTAAAGA AGGAATCACT 450 

60 GAGGACTTTG TTTGGCGTGA TATTCTTTAT CAAAGTAACT ATGAACCAGG 500 

589 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 



PCT/CA00/01150 



ATCAGCCATG 
TCCCAAGTGG 
ACGACTCGAG 
GACTTTCTTA 
5 TTGAACAAAA 
AAATTTGGGG 
ACTTCCAGCT 
GAATTTCAGT 
AATGATGGAG 

10 TAACAATCAG 
TTTCCAAAGA 
ACGGACCCTC 
TATAACAGTT 
TCGCTGATGA 

15 TTCTCAGTTG 
TGTAACGGTT 
TTGCCACCCC 
CTTCAATCTC 
TGCAATGCCT 

20 TACGCCGAAA 
AAAGAGACTT 
TCTCCTTTTA 
AAAAAGAGAC 
TTTGAAGGTT 

25 



AAGGTTATGA 
AGAATACTTC 
ATTGGGATGT 
CAAGGTTTCG 
AATGGGAGAT 
TTCCAACTCG 
GATAATATTG 
GACACAAACA 
TTATGCTGGA 
TCTGTACGTA 
GGCAGCAAGC 
TATATGGAAC 
CCTGGACAAA 
GAAAAATGGA 
TGACTATGAA 
CAACAGCCTG 
AATCTTGGAG 
CAGCCAAAAA 
AGCATCAAGG 
TATTGTGCAA 
CTGTAGAAGA 
TCGGATAAGG 
TGCCGAGACC 
CAGGTTCCGT 



CGTTAGCTTC 
AATAGCAGTG 
TAATGATGGT 
CTCACTCCAG 
GCTACTTGGT 
CTTTGGCTTG 
TTAGTATTGC 
CAAATGCTTC 
GCCAAAATTT 
AGTCACAAAA 
ACAACTCGAA 
TATGTATAAT 
ATGTAGCAGT 
GGATACTTGG 
TCCTGCTGAA 
AGCATTATTC 
CGGGCTTCAG 
TTTAGATAAA 
ATATTTCACC 
CCAATCGTTG 
AGGGACCAAT 
TAGAAGAAAT 
TTTGCTAAAT 
TGTTCAGAAG 



TTCTATTGAT 
AATTCAAAAT 
TTGACTACTG 
TAATGTTGGA 
TGGATTATCT 
ACAGATGAAT 
TCAAAGCTCA 
GTGCCTTTAC 
ATAAGTGCTA 
AGAAATAGTA 
ATCACATGAT 
CACTACACAG 
TAAATCCGGT 
TTGGTTCTAC 
AATCCTGATT 
AGGTATCCAG 
CTATGAAAGA 
GTTACGACAG 
TGGTGAGTTG 
TAGGTACTGG 
CTTGCACCAA 
TCCAGACATG 
GGTTGGATAT 



AATAATACCT 550 

AGCGGATGCG 600 

GTGGGATGAT 650 

ATGAGTCTAC 700 

AAAACGCTTT 750 

ACGCTGGTCA 800 

TTTGGGCAAG 850 

AGCTATTGCT 900 

TTTATGATAC 950 

GGAAATCCTG 10 00 

CTTAGTTGGG 1050 

GAAAGCCAAT 1100 

GCGGCTCAAA 1150 

CAATTATATT 12 00 

TTATCTTGTA 1250 

TTGGGAGAAT 13 00 

ATCTCTCAAT 1350 

AATCTTCTTA 1400 

GCGGAAGCCT 1450 

AACAAAGATT 1500 

ACCAACAAGT 1550 

TATGGCTGGA 160 0 

TGAAC TGGAA 1650 
1680 



2 ) INFORMATION FOR SEQ ID NO: 1046 

3 0 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1689 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



35 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus pneumoniae 
40 (B) STRAIN: StrR-13 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1046 

AAAAAAGAGT TGGAAACTGC AGAGGTCAAG GGGATTGATT TTACAACCAG 50 

45 TCCTAATCGT AGTTACCCAA ACGGACAATT TGCTTCTAGT TTTATCGGTC 100 

TAGCTCAACT CCATGAAAAT GAAGATGGAA GCAAGAGTTT GCTGGGAACT 150 

TCTGGAATGG AGAGTTCCTT GAACAGTATT CTTGCAGGGA AAGACGGTAT 200 

TATTACTTAT GAAAAGGATC GTCTGGGTAA TATTGTCCCT GGAACAGAAC 250 

AAGTTTCCCA ACAAACGGTA GATGG CAAGG ATGTTTATAC AACATTGTCT 300 

50 AGTCCGCTCC AATCTTTCAT GGAAACTCAG ATGGATGCCT TTCTAGAAAA 35 0 

AGTAAAAGGT AAGTATATGA CCGCGACCTT GGTCAGTGCA AAGACCGGTG 400 

AAATCCTCGC TACCACCCAA CGACCTACCT TTAATGCAGA TACTAAAGAA 450 

GGAATCACTG AGGACTTTGT TTGGCGTGAT ATTCTTTATC AAAGTAACTA 50 0 

TGAACCAGGA TCAGCCATGA AGGTTATGAC GTTAGCTTCT TCTATTGATA 550 

55 ATAATACCTT CCCAAGTGGA GAATACTTCA ATAGCAGTGA ATTCAAAATA 600 

GCGGATGCGA CGACTCGAGA TTGGGATGTT AATGATGGTT TGACTACTGG 650 

TGGGATGATG ACTTTCTTAC AAGGTTTCGC TCACTCCAGT AATGTTGGAA 700 

TGAGTCTACT TGAACAAAAA ATGGGAGATG CTACTTGGTT GGATTATCTA 750 

AAACGCTTTA AATTTGGGGT TCCAACTCGC TTTGGCTTGA CAGATGAATA 800 

60 CGCTGGTCAA CTTCCAGCTG ATAATATTGT TAGTATTGCT CAAAGCTCAT 850 

590 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 



PCT/CAOO/01150 



TTGGGCAAGG AATTTCAGTG ACACAAACAC 
GCTATTGCTA ATGATGGAGT TATGCTGGAG 
TTATGATACT AACAATCAGT CTGTACGTAA 
GAAATCCTGT TTCCAAAGAG GCAGCAAGCA 
TTAGTTGGGA CGGACCCTCT ATATGGAACT 
AAAGCCAATT ATAACAGTTC CTGGACAAAA 
CGGCTCAAAT CGCTGATGAG AAAAATGGAG 
AATTATATTT TCTCAGTTGT GACTATGAAT 
TATCTTGTAT GTAACGGTTC AACAGCCTGA 
TGGGAGAATT TGCCACCCCA ATCTTGGAGC 
TCTCTCAATC TTCAATCTCC AGCCAAAAAT 
ATCTTCTTAT GCAATGCCTA GCATCAAGGA 
CGGAAGCCTT ACGCCGAAAT ATTGTGCAAC 
ACAAAGATTA AAGAGACTTC TGTAGAAGAA 
CCAACAAGTT CTCCTTTTAT CGGATAAGGT 
ATGGCTGGAA AAAAGAGACT GCCGAGACCT 
GAACTGGAAT TTGAAGGTTC AGGTTCCGTT 



AAATGCTTCG TGCCTTTACA 900 

CCAAAATTTA TAAGTGCTAT 950 

GTCACAAAAA GAAATAGTAG 1000 

CAACTCGAAA TCACATGATC 1050 

ATGTATAATC ACTACACAGG 1100 

TGTAGCAGTT AAATCCGGTA 1150 

GATACTTGGT TGGTTCTACC 1200 

CCTGCTGAAA ATCCTGATTT 1250 

GCATTATTCA GGTATCCAGT 13 00 

GGGCTTCAGC TATGAAAGAA 1350 

TTAGATAAAG TTACGACAGA 1400 

TATTTCACCT GGTGAGTTGG 1450 

CAATCGTTGT AGGTACTGGA 1500 

GGGACCAATC TTGCACCAAA 1550 

AGAAGAAATT CCAGACATGT 1600 

TTGCTAAATG GTTGGATATT 1650 

GTTCAGAAG 1689 



2) INFORMATION FOR SEQ ID NO: 1047 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1690 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus pneumoniae 

(B) STRAIN: StrR-14 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1047 

CAAAAAAGAA TTGGAAGCTG CAGAGGTCAA GGGGATTGAT TTTACAACCA 50 

GTCCCAACCG TAGTTACCCA AACGGACAAT TTGCTTCTAG TTTTATCGGA 100 

CTAGCTCAGC TCCATGAAAA TGAAGATGGC AGCAAGAGCT TGCTGGGAAC 150 

TTCTGGAATG GAGAGTTCCT TGAACAGTAT TCTTGCAGGG ACAGACGGCA 200 

TTATTACCTA TGAAAAGGAT CGTCTGGGTA ATATTGTACC CGGAACAGAA 250 

CAAGTTTCCC AACAAACGGT AGATGGCAAG GATGTTTACA CAACCATTTC 3 00 

CAGCCCCCTC CAGTCCTTTA TGGAAACCCA GATGGATGCT TTTCAAGAGA 3 50 

AGGTAAAAGG AAAGTACACG ACAGCGACTT TGGTCAGTGC TAAAACAGGG 400 

GAAATTCTGG CAACAACGCA ACGACCGACC TTTGATGCAG ATACAAAAGA 450 

AGGCATTACA GAGGACTTTG TTTGGCGTGA TATCCTTTAC CAAAGTAACT 500 

ATGAGCCAGG TTCCCCTATG AAAGTGATGA TGTTGGCTGC TGCTATTGAT 550 

AATAATACCT TTCCAGGGGG AGAAGTCTTC AATAGTAGTG AGTTAAAAAT 600 

TGCAGATGTC ACGATTCGAG ATTGGGACGT CAATGAAGGA TTGACTGGTG 650 

GCAGAATGAT GACCTTTTCT CAAGGGTTCG CTCACTCAAG TAACGTTGGG 700 

ATGACGCTTC TTGAGCAAAA GATGGGAGAT GCTACATGGC TTGATTATCT 750 

AAATCGCTTT AAATTTGGTG TTCCGACTCG TTTTGGCTTG ACGGATGAAT 800 

ATGCAGGCCA ACTTCCAGCT GACAATATCG TAAATATTGC TCAGAGTTCA 850 

TTTGGACAAG GGATTTCAGC GACCCAGACG CAAATGATTC GTGCCTTCAC 900 

GGCTATTGCC AACGATGGAG TCATGTTAGA ACCTAAATTT ATCAGTGCCA 950 

TTTATGATCC AAATGATCAA ACTGCTCGGA AATCACAAAA AGAAGTTGTG 1000 

GGAAATCCTG TGTCTAAAGA TGCAGCGAGC TTGACGCGAA CGCATATGGT 1050 

TTTAGTCGGT ACCGATCCAG TATATGGAAC TATGTATAAT CATAAGACAG 1100 

GGAAACCAAC TGTAACTGTT CCTGGGCAAA ATGTAGCCCT CAAGTCTGGT 1150 

ACGGCTCAGA TTGCCGATGA GAAAAATGGA GGTTACTTAG TTGGTACGAC 1200 
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CAATTACATT TTTTCGGCTG TATCGATGAA 

TTATTCTCTA TGTGACGGTT CAACAGCCTG 

TTGGGGGAAT TTGCCAATCC TATCTTGGAA 

TTCCCTTAAC CTCCAATCTA CCGCTAAAAC 

5 AAAGCGCTTA TGCCATGCCT AGCATCAAGG 

GCGGAAGCCT TACGTCGCAA TATTGTGCAA 

AACAAAAATT AAAGAATCAT CTGTAGAAGA 

ACCAGCAAGT TCTTCTCTTA TCTGATAAAG 

TATGGTTGGA CAAAAGAGAC TGCTGAGACC 

10 AGAACTTGAA TTTGAAGGTT CGGGCTCTAC 



CCCTGCTGAA AATCCTGATT 1250 

AGCATTATTC AGGTATTCAG 13 0 0 

AGGGCAGTGG CTATGAAAGA 1350 

GTTAAATCAG GTAACCAATC 1400 

ACATTTCACC TGGCGATTTG 1450 

CCAATCGTTG TAGGAACAGG 150 0 

AGGGACGGAT CTTGCACCTA 1550 

CAGAGGAAGT TCCAGATATG 1600 

TTTGCTAAGT GGCTCAATAT 1650 

TGTGCAGAAG 1690 



15 



2) INFORMATION FOR SEQ ID NO: 1048 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1682 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 
20 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
25 (A) ORGANISM: Streptococcus pneumoniae 

(B) STRAIN: StrR-15 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 104 8 

30 AAAAGAGTTG GAAACTGCAG AGGTCAAGGG GATTGATTTT ACAACCAGTC 50 

CCAATCGTAG TTAC CCAAAC GGACAATTTG CTTCTAGCTT TATTGGCTTA 100 

GCCCAACTTC ATGAAAATGA GGATGGTAGT AAGAGTTTGT TAGGGACTTC 150 

TGGTTTGGAG AGTTCTTTAA ATACCATTCT TGCTGGGACA GACGGTATTA 200 

TTACCTATGA AAAAGAC CGT GTAGGAAATA TCGTACCAGG TACAGAACTG 250 

3 5 GTATCGCAAC AAACTGTGGA TGGCAAGGAT GTTTATACAA CATTGTCTAG 30 0 

TCCGCTACAA TCTTTCATGG AAACTCAGAT GGATGCCTTT CTAGAAAAAG 3 50 

TAAAAGGTAA GTATATGACC GCGACCTTGG TCAGTGCAAA GACCGGTGAA 400 

ATCCTCGCTA CCACCCAACG ACCTACCTTT AATGCAGATA CTAAAGAAGG 4 50 

AATCACTGAG GACTTTGTTT GGCGTGATAT TCTTTATCAA AGTAACTATG 500 

40 AACCAGGATC AGCCATGAAG GTTATGACGT TAGCTTCTTC TATTGATAAT 550 

AATACCTTCC CAAGTGGAGA ATACTTCAAT AGCAGTGAAT TCAAAATAGC 600 

GGATGCGACG ACTCGAGATT GGGATGTTAA TGAAGGTTTG ACTACTGGTG 650 

GGATGATGAC TTTCTTACAA GGTTTCGCTC ACTCCAGTAA TGTTGGAATG 700 

AGTCTACTTG AACAAAAAAT GGGAGATGCT ACTTGGTTGG ATTATCTAAA 750 

45 ACGCTTTAAA TTTGGGGTTC CAACTCGCTT TGGCTTGACA GATGAATACG 800 

CTGGTCAACT TCCAGCTGAT AATATTGTTA GTATTGCTCA AAGCTCATTT 850 

GGGCAAGGAA TTTCAGTGAC ACAAACACAA ATGCTTCGTG CCTTTACAGC 900 

TATTGCTAAT GATGGAGTTA TGCTGGAGCC AAAATTTATA AGTGCTATTT 950 

ATGATACTAA CAATCAGTCT GTACGTAAGT CACAAAAAGA AATAGTAGGA 1000 

50 AATCCTGTTT CCAAAGAGGC AGCAAGCACA ACTCGAAATC ACATGATCTT 105 0 

AGTTGGGACG GACCCTCTAT ATGGAACTAT GTATAATCAC TACACAGGAA 1100 

AGCCAATTAT AACAGTTCCT GGACAAAATG TAGCAGTTAA ATCCGGTACG 1150 

GCTCAAATCG CTGATGAGAA AAATGGAGGA TACTTGGTTG GTTCTACCAA 1200 

TTATATTTTC TCAGTTGTGA CTATGAATCC TGCTGAAAAT CCTGATTTTA 1250 

55 TCTTGTATGT AACGGTTCAA CAGCCTGAGC ATTATTCAGG TATCCAGTTG 13 00 

GGAGAATTTG CCACCCCAAT CTTGGAGCGG GCTTCAGCTA TGAAAGAATC 13 50 

TCTCAATCTT CAATCTCCAG CCAAAAATTT AGATAAAGTT ACGACAGAAT 14 00 

CTTCTTATGC AATGCCTAGC ATCAAGGATA TTTCACCTGG TGAGTTGGCG 1450 

GAAGCCTTAC GCCGAAATAT TGTGCAACCA ATCGTTGTAG GTACTGGAAC 1500 

60 AAAGATTAAA GAGACTTCTG TAGAAGAAGG GACCAATCTT GCACCAAACC 1550 
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AAGAAATTCC AGACATGTAT 1600 
GCTAAATGGT TGGATATTGA 1650 
TC 1682 



2 ) INFORMATION FOR SEQ ID NO: 104 9 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1241 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus faecium 

(B) STRAIN: R690 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1049 

GCAGCGTTGC GTGATACCGT TGAAAAAACC ATTAAAAACT GTTTGGATTT 50 

TGAAAGGAGA CAGGAGCATG AATAGAATAA AAGTTGCAAT ACTGTTTGGG 100 

GGTTGCTCAG AGGAGCATGA CGTATCGGTA AAATC TGCAA TAGAGATAGC 150 

CGCTAACATT AATAAAGAAA AATACGAGCC GTTATACATT GGAATTACGA 200 

AATCTGGTGT ATGGAAAATG TGCGAAAAAC CTTGCGCGGA ATGGGAAAAC 250 

GACAATTGCT ATTCAGCTGT ACTCTCGCCG GATAAAAAAA TGCACGGATT 300 

ACTTGTTAAA AAGAACCATG AATATGAAAT CAACCATGTT GATGTAGCAT 350 

TTTCAGCTTT GCATGGCAAG TCAGGTGAAG ATGGATCCAT ACAAGGTCTG 400 

TTTGAATTGT CCGGTATCCC TTTTGTAGGC TGCGATATTC AAAGCTCAGC 450 

AATTTGTATG GACAAATCGT TGACATACAT CGTTGCGAAA AATGCTGGGA 500 

TAGCTACTCC CGCCTTTTGG GTTATTAATA AAGATGATAG GCCGGTGGCA 550 

GCTACGTTTA CCTATCCTGT TTTTGTTAAG CCGGCGCGTT CAGGCTCATC 600 

CTTCGGTGTG AAAAAAGTCA ATAGCGCGGA CGAATTGGAC TACGCAATTG 650 

AATCGGCAAG ACAATATGAC AGCAAAATCT TAATTGAGCA GGCTGTTTCG 700 

GGCTGTGAGG TCGGTTGTGC GGTATTGGGA AACAGTGCCG CGTTAGCTGT 750 

TGGCGAGGTG GACCAAATCA GGCTGCAGTA CGGAATCTTT CGTATTCATC 800 

AGGAAGTCGA GCCGGAAAAA GGCTCTGAAA ACGCAGTTAT AACCGTTCCC 850 

GCAGACCTTT CAGCAGAGGA GCGAGGACGG ATACAGGAAA CGGCAAAAAA 900 

AATATATAAA GCGCTCGGCT GTAGAGGTCT AGCCCGTGTG GATATGTTTT 950 

TACAAGATAA CGGCCGCATT GTACTGAACG AAGTCAATAC TCTGCCCGGT 1000 

TTCACGTCAT ACAGTCGTTA TCCCCGTATG ATGGCCGCTG CAGGTATTGC 1050 

ACTTCCCGAA CTGATTGACC GCTTGATCGT ATTAGCGTTA AAGGGGTGAT 110 0 

AAGCATGGAA ATAGGATTTA CTTTTTTAGA TGAAATAGTA CACGGTGTTC 1150 

GTTGGGACGC TAAATATGCC ACTTGGGATA ATTTCACCGG AAAAC CGGTT 1200 

GACGGTTATG AAGTAAATCG CATTGTAGGG ACATACGAAT T 1241 



2 ) INFORMATION FOR SEQ ID NO: 1050 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1249 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 
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(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus gallinarum 

(B) STRAIN: R691 



5 (xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1050 

TCACACCGCA TACGGCCTAT TATACCGAGC AAGCGTTGCG TGATACCGTT 50 

GAAAAAACCA TTAAAAACTG TTTGGATTTT GAAAGGAGAC AGGAGCATGA 100 

ATAGAATAAA AGTTGGAATA CTGTTTGGGG GTTGCTCAGA GGAGCATGAC 150 

10 GTATCGGTAA AATCTGCAAT AGAGATAGCC GCTAACATTA ATAAAGAAAA 200 

ATACGAGCCG TTATACATTG GAATTACGAA ATCTGGTGTA TGGAAAATGT 250 

GCGAAAAACC TTGCGCGGAA TGGGAAAACG ACAATTGCTA TTCAGCTGTA 300 

CTCTCGCCGG ATAAAAAAAT GCACGGATTA CTTGTTAAAA AGAAC CATGA 3 50 

ATATGAAATC AACCATGTTG ATGTAGCATT TTCAGCTTTG CATGGCAAGT 400 

15 CAGGTGAAGA TGGATCCATA CAAGGTCTGT TTGAATTGTC CGGTATCCCT 450 

TTTGTAGGCT GCGATATTCA AAGCTCAGCA ATTTGTATGG ACAAATCGTT 50 0 

GACATACATC GTTGCGAAAA ATGCTGGGAT AGCTACTCCC GCCTTTTGGG 550 

TTATTAATAA AGATGATAGG CCGGTGGCAG CTACGTTTAC CTATCCTGTT 600 

TTTGTTAAGC CGGCGCGTTC AGGCTCATCC TTCGGTGTGA AAAAAGTCAA 650 

2 0 TAGCGCGGAC GAATTGGACT ACGCAATTGA ATCGGCAAGA CAATATGACA 700 

GCAAAATCTT AATTGAGCAG GCTGTTTCGG GCTGTGAGGT CGGTTGTGCG 750 

GTATTGGGAA ACAGTGCCGC GTTAGCTGTT GGCGAGGTGG ACCAAATCAG 80 0 

GCTGCAGTAC GGAATCTTTC GTATTCATCA GGAAGTCGAG CCGGAAAAAG 850 

GCTCTGAAAA CGCAGTTATA ACCGTTCCCG CAGACCTTTC AGCAGAGGAG 900 

25 CGAGGACGGA TACAGGAAAC GGCAAAAAAA ATATATAAAG CGCTCGGCTG 950 

TAGAGGTCTA GCCCGTGTGG ATATGTTTTT ACAAGATAAC GGCCGCATTG 1000 

TACTGAACGA AGT CAAT ACT CTGCCCGGTT TCACGTCATA CAGTCGTTAT 1050 

CCCCGTATGA TGGCCGCTGC AGGTATTGCA CTTCCCGAAC TGATTGACCG 110 0 

CTTGATCGTA TTAGCGTTAA AGGGGTGATA AGCATGGAAA TAGGATTTAC 1150 

30 TTTTTTAGAT GAAATAGTAC ACGGTGTTCG TTGGGACGCT AAATATGCCA 12 00 

CTTGGGATAA TTTCACCGGA AAACCGGTTG ACGGTTATGA AGTAAATCG 124 9 



35 2) INFORMATION FOR SEQ ID NO: 1051 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 72 bases 

(B) TYPE: Nucleic acid 
40 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 



45 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus .faecixjm 

(B) STRAIN: R481 



50 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1051 



TATACCGAGC AAGCGTTGCG TGATACCGTT GAAAAAACCA TTAAAAACTG 50 

TTTGGATTTT GAAAGGAGAC AGGAGCATGA ATAGAATAAA AGTTGCAATA 100 

CTGTTTGGGG GTTGCTCAGA GGAGCATGAC GTATCGGTAA AATCTGCAAT 150 

AGAGATAGCC GCTAACATTA ATAAAGAAAA ATACGAGCCG TTATACATTG 200 

55 GAATTACGAA ATCTGGTGTA .TGGAAAATGT GCGAAAAACC TTGCGCGGAA 250 

TGGGAAAACG ACAATTGCTA TTCAGCTGTA CTCTCGCCGG ATAAAAAAAT 3 00 

GCACGGATTA CTTGTTAAAA AGAACCATGA ATATGAAATC AACCATGTTG 350 

ATGTAGCATT TTCAGCTTTG CATGGCAAGT CAGGTGAAGA TGGATCCATA 400 

CAAGGTCTGT TTGAATTGTC CGGTATCCCT TTTGTAGGCT GCGATATTCA 450 

60 AAGCTCAGCA ATTTGTATGG ACAAATCGTT GACATACATC GTTGCGAAAA 500 
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10 



15 



ATGCTGGGAT 
CCGGTGGCAG 
AGGCTCATCC 
ACGCAATTGA 
GCTGTTTCGG 
GTTAGTTGTT 
GTATTCATCA 
ACCGTTCCCG 
GGCAAAAAAA 
ATATGTTTTT 
CTGCCCGGTT 
AGGTATTGCA 
AGGGGTGATA 
ACGGTGTTCG 
AAACCGGTTG 
GGCTTGAATC 



AGCTACTCCC 
CTACGTTTAC 
TTCGGTGTGA 
ATCGGCAAGA 
GCTGTGAGGT 
GGCGAGGTGG 
GGAAGTCGAG 
CAGACCTTTC 
ATATATAAAG 
ACAAGATAAC 
TCACGTCATA 
CTTCCCGAAC 
AGCATGGAAA 
TTGGGACGCT 
ACGGTTATGA 
GCTTTTTGAA 



GCCTTTTGGG 
CTATCCTGTT 
AAAAAGTCAA 
CAATATGACA 
CGGTTGTGCG 
ACCAAATCAG 
CCGGAAAAAG 
AGCAGAGGAG 
CGCTCGGCTG 
GGCCGCATTG 
CAGTCGTTAT 
TGATTGACCG 
TAGGATTTAC 
AAATATGCCA 
AGTAAATCGC 
GG 



TTATTAATAA 
TTTGTTAAGC 
TAGCGCGGAC 
GCAAAATCTT 
GTATTGGGAA 
GCTGCAGTAC 
GCTCTGAAAA 
CGAGGACGGA 
TAGAGGTCTA 
TACTGAACGA 
CCCCGTATGA 
CTTGATCGTA 
TTTTTTAGAT 
CTTGGGATAA 
ATTGTAGGGA 



AGATGATAGG 550 

CGGCGCGTTC 600 

GAATTGGACT 650 

AATTGAGCAG 700 

ACAGTGCCGC 750 

GGAATCTTTC 800 

CGCAGTTATA 850 

TACAGGAAAC 900 

GCCCGTGTGG 950 

AGTCAATACT 1000 

TGGCCGCTGC 105 0 

TTAGCGTTAA 1100 

GAAATAGTAC 1150 

TTTCACCGGA 120 0 

CATACGAATT 1250 
1272 



20 2) INFORMATION FOR SEQ ID NO: 1052 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1237 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 



25 



30 



35 



40 



45 



50 



55 



60 



(vi) ORIGINAL SOURCE : 

(A) ORGANISM: Enterococcus faecium 

(B) STRAIN: R492 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1052 



TCCCCCGGCA 
GAAAAAACCA 
ATAGAATAAA 
GTATCGGTAA 
ATACGAGCCG 
GCGAAAAACC 
CTCTCGCCGG 
ATATGAAATC 
CAGGTGAAGA 
TTTGTAGGCT 
GACATACATC 
TTATTAATAA 
TTTGTTAAGC 
TAGCGCGGAC 
GCAAAATCTT 
GTATTGGGAA 
GCTGCAGTAC 
GCTCTGAAAA 
CGAGGACGGA 
TAGAGGTCTA 
TACTGAACGA 
CCCCGTATGA 
CTTGATCGTA 
TTTTTTAGAT 
CTTGGGATAA 



TACGGCCTAT 
TTAAAAACTG 
AGTTGCAATA 
AATCTGCAAT 
TTATACATTG 
TTGCGCGGAA 
ATAAAAAAAT 
AACCATGTTG 
TGGATCCATA 
GCGATATTCA 
GTTGCGAAAA 
AGATGATAGG 
CGGCGCGTTC 
GAATTGGACT 
AATTGAGCAG 
ACAGTGCCGC 
GGAATCTTTC 
CGCAGTTATA 
TACAGGAAAC 
GCCCGTGTGG 
AGTCAATACT 
TGGCCGCTGC 
TTAGCGTTAA 
GAAATAGTAC 
TTTCACCGGA 



TATACCGAGC 
TTTGGATTTT 
CTGTTTGGGG 
AGAGATAGCC 
GAATTACGAA 
TGGGAAAACG 
GCACGGATTA 
ATGTAGCATT 
CAAGGTCTGT 
AAGCTCAGCA 
ATGCTGGGAT 
CCGGTGGCAG 
AGGCTCATCC 
ACGCAATTGA 
GCTGTTTCGG 
GTTAGCTGTT 
GTATTCATCA 
ACCGTTCCCG 
GGCAAAAAAA 
ATATGTTTTT 
CTGCCCGGTT 
AGGTATTGCA 
AGGGGTGATA 
ACGGTGTTCG 
AAACCGGTTG 



AAGCGTTGCG 
GAAAGGAGAC 
GTTGCTCAGA 
GCTAACATTA 
ATCTGGTGTA 
ACAATTGCTA 
CTTGTTAAAA 
TTCAGCTTTG 
TTGAATTGTC 
ATTTGTATGG 
AGCTACTCCC 
CTACGTTTAC 
TTCGGTGTGA 
ATCGGCAAGA 
GCTGTGAGGT 
GGCGAGGTGG 
GGAAGTCGAG 
CAGACCTTTC 
ATATATAAAG 
ACAAGATAAC 
TCACGTCATA 
CTTCCCGAAC 
AGCATGGAAA 
TTGGGACGCT 
ACGGTTA 



TGATACCGTT 50 

AGGAGCATGA 100 

GGAGCATGAC 150 

ATAAAGAAAA 200 

TGGAAAATGT 250 

TTCAGCTGTA 3 00 

AGAACCATGA 350 

CATGGCAAGT 400 

CGGTATCCCT 450 

ACAAATCGTT 500 

GCCTTTTGGG 550 

CTATCCTGTT 600 

AAAAAGTCAA 650 

CAATATGACA 700 

CGGTTGTGCG 750 

ACCAAATCAG 800 

CCGGAAAAAG 850 

AGCAGAGGAG 900 

CGCTCGGCTG 950 

GGCCGCATTG 1000 

CAGTCGTTAT 1050 

TGATTGACCG 1100 

TAGGATTTAC 1150 

AAATATGCCA 1200 
1237 
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2 ) INFORMATION FOR SEQ ID NO: 1053 

5 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1263 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 
10 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
15 (A) ORGANISM: Enterococcus faecium 

(B) STRAIN: R581 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1053 

20 CATACGGCCT ATTATACCGA GCAAGCGTTG CGTGATACCG TTGAAAAAAC 50 

CATTAAAAAC TGTTTGGATT TTGAAAGGAG ACAGGAGCAT GAATAGAATA 100 

AAAGTTGCAA TACTGTTTGG GGGTTGCTCA GAGGAGCATG ACGTATCGGT 150 

AAAATCTGCA ATAGAGATAG CCGCTAACAT TAATAAAGAA AAATACGAGC 200 

CGTTATACAT TGGAATTACG AAATCTGGTG TATGGAAAAT GTGCGAAAAA 250 

25 CCTTGCGCGG AATGGGAAAA CGACAATTGC TATTCAGCTG TACTCTCGCC 3 00 

GGATAAAAAA ATGCACGGAT TACTTGTTAA AAAGAACCAT GAATATGAAA 350 

TCAACCATGT TGATGTAGCA TTTTCAGCTT TGCATGGCAA GTCAGGTGAA 4 00 

GATGGATCCA TACAAGGTCT GTTTGAATTG TCCGGTATCC CTTTTGTAGG 450 

CTGCGATATT CAAAGCTCAG CAATTTGTAT GGACAAATCG TTGACATACA 500 

30 TCGTTGCGAA AAATGCTGGG ATAGCTACTC CCGCCTTTTG GGTTATTAAT 550 

AAAGATGATA GGCCGGTGGC AGCTACGTTT ACCTATCCTG TTTTTGTTAA 600 

GCCGGCGCGT TCAGGCTCAT CCTTCGGTGT GAAAAAAGTC AATAGCGCGG 650 

ACGAATTGGA CTACGCAATT GAATCGGCAA GACAATATGA CAGCAAAATC 70 0 

TTAATTGAGC AGGC TGTTTC GGGCTGTGAG GTCGGTTGTG CGGTATTGGG 750 

35 AAACAGTGCC GCGTTAGCTG TTGGCGAGGT GGACCAAATC AGGCTGCAGT 800 

ACGGAATCTT TCGTATTCAT CAGGAAGTCG AGCCGGAAAA AGGCTCTGAA 850 

AACGCAGTTA TAACCGTTCC CGCAGACCTT TCAGCAGAGG AGCGAGGACG 900 

GATACAGGAA ACGGCAAAAA AAATATATAA AGCGCTCGGC TGTAGAGGTC 950 

TAGCCCGTGT GGATATGTTT TTACAAGATA ACGGCCGCAT TGTACTGAAC 1000 

40 GAAGTCAATA CTCTGCCCGG TTTCACGTCA TACAGTCGTT ATCCCCGTAT 1050 

GATGGCCGCT GCAGGTATTG CACTTCCCGA ACTGATTGAC CGCTTGATCG 1100 

TATTAGCGTT AAAGGGGTGA TAAGCATGGA AATAGGATTT ACTTTTTTAG 1150 

ATGAAATAGT ACACGGTGTT CGTTGGGACG CTAAATATGC CACTTGGGAT 1200 

AATTTCACCG GAAAACCGGT TGACGGGTAT GAAAGTAAAT CGCATTGTAG 1250 

45 GGACATTCGA ATT 12 63 



50 



2) INFORMATION FOR SEQ ID NO: 1054 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1232 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
55 (D) TOPOLOGY; Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
60 (A) ORGANISM: Enterococcus fa.eca.lis 
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STRAIN: R610 



10 



15 



20 



25 



30 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1054 



CGGCCTATTA 
AAAAACTGTT 
TTGCAATACT 
TCTGCAATAG 
ATACATTGGA 
GCGCGGAATG 
AAAAAAATGC 
CCATGTTGAT 
GATCCATACA 
GATATTCAAA 
TGCGAAAAAT 
ATGATAGGCC 
GCGCGTTCAG 
ATTGGACTAC 
TTGAGCAGGC 
AGTGCCGCGT 
AATCTTTCGT 
CAGTTATAAC 
CAGGAAACGG 
CCGTGTGGAT 
TCAATACTCT 
GCCGCTGCAG 
AGCGTTAAAG 
AATAGTACAC 
TCACCGGAAA 



TNCCGAGCAA 
TGGATTTTGA 
GTTTGGGGGT 
AGATAGCCGC 
ATTACGAAAT 
GGAAAACGAC 
ACGGATTACT 
GTAGCATTTT 
AGGTCTGTTT 
GCTCAGCAAT 
GCTGGGATAG 
GGTGGCAGCT 
GCTCATCCTT 
GCAATTGAAT 
TGTTTCGGGC 
TAGTTGTTGG 
ATTCATCAGG 
CGTTCCCGCA 
CAAAAAAAAT 
ATGTTTTTAC 
GCCCGGTTTC 
GTATTGCACT 
GGGTGATAAG 
GGTGTTCGTT 
ACCGGTTGAC 



GCGTTGCGTG 
AAGGAGACAG 
TGCTCAGAGG 
TAACATTAAT 
CTGGTGTATG 
AATTGCTATT 
TGTTAAAAAG 
CAGCTTTGCA 
GAATTGTCCG 
TTGTATGGAC 
CTACTCCCGC 
ACGTTTACCT 
CGGTGTGAAA 
CGGCAAGACA 
TGTGAGGTCG 
CGAGGTGGAC 
AAGTCGAGCC 
GACCTTTCAG 
ATATAAAGCG 
AAGATAACGG 
ACGTCATACA 
TCCCGAACTG 
CATGGAAATA 
GGGACGCTAA 
GGTTATAAGT 



ATACCGTTGA 
GAGCATGAAT 
AGCATGACGT 
AAAGAAAAAT 
GAAAATGTGC 
CAGCTGTACT 
AACCATGAAT 
TGGCAAGTCA 
GTATCCCTTT 
AAATCGTTGA 
CTTTTGGGTT 
ATCCTGTTTT 
AAAGTCAATA 
ATATGACAGC 
GTTGTGCGGT 
CAAATCAGGC 
GGAAAAAGGC 
CAGAGGAGCG 
CTCGGCTGTA 
CCGCATTGTA 
GTCGTTATCC 
ATTGACCGCT 
GGATTTACTT 
ATATGCCACT 
AA 



AAAAACCATT 50 

AGAATAAAAG 100 

ATCGGTAAAA 150 

ACGAGCCGTT 2 00 

GAAAAACCTT 250 

CTCGCCGGAT 3 00 

ATGAAATCAA 350 

GGTGAAGATG 400 

TGTAGGCTGC 450 

CATACATCGT 500 

ATTAATAAAG 550 

TGTTAAGCCG 600 

GCGCGGACGA 650 

AAAATCTTAA 700 

ATTGGGAAAC 750 

TGCAGTACGG 800 

TCTGAAAACG 850 

AGGACGGATA 900 

GAGGTCTAGC 950 

CTGAACGAAG 1000 

CCGTATGATG 1050 

TGATCGTATT 1100 

TTTTAGATGA 1150 

TGGGATAATT 1200 
1232 



2) INFORMATION FOR SEQ ID NO: 1055 

35 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1218 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



40 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE : 

(A) ORGANISM: Ent-erococcus gallinarum 
45 (B) STRAIN: R684 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1055 

TACCGAGCAA GCGTTGCGTG ATACCGTTGA AAAAACCATT AAAAACTGTT 50 

50 TGGATTTTGA AAGGAGACAG GAGCATGAAT AGAATAAAAG TTGCAATACT 100 

GTTTGGGGGT TGCTCAGAGG AGCATGACGT ATCGGTAAAA TCTGCAATAG 150 

AGATAGCCGC TAACATTAAT AAAGAAAAAT ACGAGCCGTT ATACATTGGA 2 00 

ATTACGAAAT CTGGTGTATG GAAAATGTGC GAAAAACCTT GCGCGGAATG 250 

GGAAAACGAC AATTGCTATT CAGCTGTACT CTCGCCGGAT AAAAAAATGC 300 

55 ACGGATTACT TGTTAAAAAG AACCATGAAT ATGAAATCAA CCATGTTGAT 350 

GTAGCATTTT CAGCTTTGCA TGGCAAGTCA GGTGAAGATG GATCCATACA 400 

AGGTCTGTTT GAATTGTCCG GTATCCCTTT TGTAGGCTGC GATATTCAAA 450 

GCTCAGCAAT TTGTATGGAC AAATCGTTGA CATACATCGT TGCGAAAAAT 500 

GCTGGGATAG CTACTCCCGC CTTTTGGGTT ATTAATAAAG ATGATAGGCC 550 

60 GGTGGCAGCT ACGTTTACCT ATCCTGTTTT TGTTAAGCCG GCGCGTTCAG 600 

597 
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GCTCATCCTT CGGTGTGAAA AAAGTCAATA GCGCGGACGA ATTGGACTAC 650 

GCAATTGAAT CGGCAAGACA ATATGACAGC AAAATCTTAA TTGAGCAGGC 700 

TGTTTCGGGC TGTGAGGTCG GTTGTGCGGT ATTGGGAAAC AGTGCCGCGT 750 

TAGCTGTTGG CGAGGTGGAC CAAATCAGGC TGCAGTACGG AATCTTTCGT 800 

5 ATTCATCAGG AAGTCGAGCC GGAAAAAGGC TCTGAAAACG CAGTTATAAC 850 

CGTTCCCGCA GACCTTTCAG CAGAGGAGCG AGGACGGATA CAGGAAACGG 900 

CAAAAAAAAT ATATAAAGCG CTCGGCTGTA GAGGTCTAGC CCGTGTGGAT 950 

ATGTTTTTAC AAGATAACGG CCGCATTGTA CTGAACGAAG TCAATACTCT 1000 

GCCCGGTTTC ACGTCATACA GTCGTTATCC CCGTATGATG GCCGCTGCAG 1050 

10 GTATTGCACT TCCCGAACTG ATTGACCGCT TGATCGTATT AGCGTTAAAG 110 0 

GGGTGATAAG CATGGAAATA GGATTTACTT TTTTAGATGA AATAGTACAC 1150 

GGTGTTCGTT GGGACGCTAA ATATGCCACT TGGGATAATT TCACCGGAAA 12 00 

ACCGGTTGAC GGTTAGAA 1218 



15 



30 



60 



2) INFORMATION FOR SEQ ID NO: 1056 



(i) SEQUENCE CHARACTERISTICS: 
2 0 (A) LENGTH: 12 65 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

25 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus faeciura 

(B) STRAIN: R688 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1056 



AATCACACCG CATACGGCCT ATTATACCGA GCAAGCGTTG CGTGATACCG 50 

TTGAAAAAAC CATTAAAAAC TGTTTGGATT TTGAAAGGAG ACAGGAGCAT 100 

3 5 GAATAGAATA AAAGTTGCAA TACTGTTTGG GGGTTGCTCA GAGGAGCATG 150 

ACGTATCGGT AAAATCTGCA ATAGAGATAG CCGCTAACAT TAATAAAGAA 2 00 

AAATACGAGC CGTTATACAT TGGAATTACG AAATCTGGTG TATGGAAAAT 250 

GTGCGAAAAA CCTTGCGCGG AATGGGAAAA CGACAATTGC TATTCAGCTG 300 

TACTCTCGCC GGATAAAAAA ATGCACGGAT TACTTGTTAA AAAGAACCAT 350 

40 GAATATGAAA TCAACCATGT TGATGTAGCA TTTTCAGCTT TGCATGGCAA 400 

GTCAGGTGAA GATGGATCCA TACAAGGTCT GTTTGAATTG TCCGGTATCC 450 

CTTTTGTAGG CTGCGATATT CAAAGCTCAG CAATTTGTAT GGACAAATCG 50 0 

TTGACATACA TCGTTGCGAA AAATGCTGGG ATAGCTACTC CCGCCTTTTG 550 

GGTTATTAAT AAAGATGATA GGCCGGTGGC AGCTACGTTT ACCTATCCTG 600 

45 TTTTTGTTAA GCCGGCGCGT TCAGGCTCAT CCTTCGGTGT GAAAAAAGTC 650 

AATAGCGCGG ACGAATTGGA CTACGCAATT GAATCGGCAA GACAATATGA 700 

CAGCAAAATC TTAATTGAGC AGGCTGTTTC GGGCTGTGAG GTCGGTTGTG 750 

CGGTATTGGG AAACAGTGCC GCGTTAGCTG TTGGCGAGGT GGACCAAATC 800 
AGGCTGCAGT ACGGAATCTT TCGTATTCAT CAGGAAGTCG AGCCGGAAAA 850 

50 AGGCTCTGAA AACGCAGTTA TAACCGTTCC CGCAGACCTT TCAGCAGAGG 900 
AGCGAGGACG GATACAGGAA ACGGCAAAAA AAATATATAA AGCGCTCGGC 950 

TGTAGAGGTC TAGCCCGTGT GGATATGTTT TTACAAGATA ACGGCCGCAT 100 0 

TGTACTGAAC GAAGTCAATA CTCTGCCCGG TTTCACGTCA TACAGTCGTT 1050 

ATCCCCGTAT GATGGCCGCT GCAGGTATTG CACTTCCCGA ACTGATTGAC 1100 

55 CGCTTGATCG TATTAGCGTT AAAGGGGTGA TAAGCATGGA AATAGGATTT 1150 

ACTTTTTTAG ATGAAATAGT ACACGGTGTT CGTTGGGACG CTAAATATGC 1200 

CACTTGGGAT AATTTCACCG GAAAACCGGT TGACGGTTAT GAAGTAAATC 1250 

GCATTGTAGG GACAT 1265 



598 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 



2) INFORMATION FOR SEQ ID NO: 1057 

(i) SEQUENCE CHARACTERISTICS : 
5 (A) LENGTH: 1269 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

10 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus flavescens 

(B) STRAIN: R689 



15 



45 



60 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1057 



TCACACCGCA TACGGCCTAT TATACCGAGC AAGCGTTGCG TGATACCGTT 50 

GAAAAAACCA TTAAAAACTG TTTGGATTTT GAAAGGAGAC AGGAGCATGA 100 

2 0 ATAGAATAAA AGTTGCAATA CTGTTTGGGG GTTGCTCAGA GGAGCATGAC 150 

GTATCGGTAA AATCTGCAAT AGAGATAGCC GCTAACATTA ATAAAGAAAA 200 

ATACGAGCCG TTATACATTG GAATTACGAA ATCTGGTGTA TGGAAAATGT 250 

GCGAAAAACC TTGCGCGGAA TGGGAAAACG ACAATTGCTA TTCAGCTGTA 3 00 

CTCTCGCCGG ATAAAAAAAT GCACGGATTA CTTGTTAAAA AGAACCATGA 3 50 

2 5 ATATGAAATC AAC C ATGTTG ATGTAGCATT TTCAGCTTTG CATGGCAAGT 4 00 

CAGGTGAAGA TGGATCCATA CAAGGTCTGT TTGAATTGTC CGGTATCCCT 450 

TTTGTAGGCT GCGATATTCA AAGCTCAGCA ATTTGTATGG ACAAATCGTT 500 

GACATACATC GTTGCGAAAA ATGCTGGGAT AGCTACTCCC GCCTTTTGGG 550 

TTATTAATAA AGATGATAGG CCGGTGGCAG CTACGTTTAC CTATCCTGTT 600 

30 TTTGTTAAGC CGGCGCGTTC AGGCTCATCC TTCGGTGTGA AAAAAGTCAA 650 

TAGCGCGGAC GAATTGGACT ACGCAATTGA ATCGGCAAGA CAATATGACA 700 

GCAAAATCTT AATTGAGCAG 'GCTGTTTCGG GCTGTGAGGT CGGTTGTGCG 750 

GTATTGGGAA ACAGTGCCGC GTTAGCTGTT GGCGAGGTGG ACCAAATCAG 800 

GCTGCAGTAC GGAATCTTTC GTATTCATCA GGAAGTCGAG CCGGAAAAAG 850 

35 GCTCTGAAAA CGCAGTTATA ACCGTTCCCG CAGACCTTTC AGCAGAGGAG 900 

CGAGGACGGA TACAGGAAAC GGCAAAAAAA ATATATAAAG CGCTCGGCTG 950 

TAGAGGTCTA GCCCGTGTGG ATATGTTTTT ACAAGATAAC GGCCGCATTG 100 0 

TACTGAACGA AGTCAATACT CTGCCCGGTT TCACGTCATA CAGTCGTTAT 105 0 

CCCCGTATGA TGGCCGCTGC AGGTATTGCA CTTCCCGAAC TGATTGACCG 110 0 

40 CTTGATCGTA TTAGCGTTAA AGGGGTGATA AGCATGGAAA TAGGATTTAC 1150 

TTTTTTAGAT GAAATAGTAC ACGGTGTTCG TTGGGACGCT AAATATGCCA 12 00 

CTTGGGATAA TTTCACCGGA AAACCGGTTG ACGGTTATGA AGTAAATCGC 1250 

ATTGTAGGGA CATACGAAT 1269 



2) INFORMATION FOR SEQ ID NO: 1058 



(i) SEQUENCE CHARACTERISTICS: 
50 (A) LENGTH: 1169 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

55 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus gallinarum 

(B) STRAIN: R42 0 



599 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1058 

CAAATTTTCT TTTCTTTTCC TAGGTACACT GAATGTAACC TTAAAAGAAA 50 

AAAGGAAAGG AAGAAAATGA TGAAAAAAAT TGCCGTTTTA TTTGGAGGGA 100 

5 ATTCTCCAGA ATACTCAGTG TCACTAACCT CAGCAGCAAG TGTGATCCAA 150 

GCTATTAACC CGCTGAAATA TGAAGTAATG AC CATTGGCA TCGCACCAAC 2 00 

AATGGATTGG TATTGGTATC AAGGAAACCT CGCGAATGTT CGCAATGATA 2 50 

CTTGGCTAGA AGATCACAAA AACTGTCACC AGCTGACTTT TTCTAGCCAA 3 00 

GGATTTATAT TAGGAGAAAA ACGAATCGTC CCTGATGTCC TCTTTCCAGT 350 

10 CTTGCATGGG AAGTATGGCG AGGATGGCTG TATCCAAGGA CTGCTTGAAC 4 00 

TAATGAACCT GCCTTATGTT GGTTGCCATG TCGCTGCCTC CGCATTATGT 450 

ATGAACAAAT GGCTCTTGCA TCAACTTGCT GATACCATGG GAATCGCTAG 50 0 

TGCTCCCACT TTGCTTTTAT CCCGCTATGA AAACGATCCT GCCACAATCG 550 

ATCGTTTTAT TCAAGACCAT GGATTCCCGA TCTTTATCAA GCCGAATGAA 600 

15 GCCGGTTCTT CAAAAGGGAT CACAAAAGTA ACTGACAAAA CAGCGCTCCA 650 

ATCTGCATTA ACGACTGCTT TTGCTTACGG TTCTACTGTG TTGATCCAAA 700 

AGGCGATAGC GGGTATTGAA ATTGGCTGCG GCATCTTAGG AAATGAGCAA 750 

TTGACGATTG GTGCTTGTGA TGCGATTTCT CTTGTCGACG GTTTTTTTGA 800 

TTTTGAAGAG AAATACCAAT TAATCAGCGC CACGATCACT GTCCCAGCAC 850 

20 CATTGCCTCT CGCGCTTGAA TCACAGATCA AGGAGCAGGC ACAGCTGCTT 900 

TATCGAAACT TGGGATTGAC GGGTCTGGCT CGAATCGATT TTTTCGTCAC 950 

CAATCAAGGA GCGATTTATT TAAACGAAAT CAACACCATG CCGGGATTTA 1000 

CTGGGCACTC CCGCTACCCA GCTATGATGG CGGAAGTCGG GTTATCCTAC 1050 

GAAATATTAG TAGAGCAATT GATTGCACTG GCAGAGGAGG ACAAACGATG 1100 

25 AACACATTAC AATTGATCAA TAAAAACCAT CCATTGAAAA AAAATCAAGA 1150 

GCCCCCGCAC TTAGTGCTA 1169 



30 2) INFORMATION FOR SEQ ID NO: 1059 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1166 bases 

(B) TYPE: Nucleic acid 
35 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

40 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus gallinarum 

(B) STRAIN: R631 



45 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1059 



CAAATTTTCT TTTCTTTTCC TAGGTACACT GAATGTAACC TTAAAAGAAA 50 

AAAGGAAAGG AAGAAAATGA TGAAAAAAAT TGCCGTTTTA TTTGGAGGGA 100 

ATTCTCCAGA ATACTCAGTG TCACTAGCCT CAGCAGCAAG TGTGATCCAA 150 

GCTATTGACC CGCTGAAATA TGAAGTAATG ACCATTGGCA TCGCACCAAC 200 

50 AATGGATTGG TATTGGTATC AAGGAAACCT CGCGAATGTT CGCAATGATA 250 

CTTGGCTAGA AGATCACAAA AACTGTCACC AGCTGACTTT TTCTAGCCAA 3 00 

GGATTTATAT TAGGAGAAAA ACGAATCGTC CCTGATGTCC TCTTTCCAGT 350 

CTTGCATGGG AAGTATGGCG AGGATGGCTG TATCCAAGGA CTGCTTGAAC 400 

TAATGAACCT GCCTTATGTT GGTTGCCATG TCGCTGCCTC CGCATTATGT 450 

55 ATGAACAAAT GGCTCTTGCA TCAACTTGCT GATACCATGG GAATCGCTAG 500 

TGCTCCCACT TTGCTTTTAT CCCGCTATGA AAACGATCCT GCCACAATCG 550 

ATCGTTTTAT TCAAGACCAT GGATTCCCGA TCTTTATCAA GCCGAATGAA 600 

GCCGGTTCTT CAAAAGGGAT CACAAAAGTA ACTGACAAAA CAGCGCTCCA 650 

ATCTGCATTA ACGACTGCTT TTGCTTACGG TTCTACTGTG TTGATCCAAA 700 

60 AGGCGATAGC GGGTATTGAA ATTGGCTGCG GCATCTTAGG AAATGAGCAA 750 

600 
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TTGACGATTG GTGCTTGTGA TGCGATTTCT 
TTTTGAAGAG AAATACCAAT TAATCAGCGC 
CATTGCCTCT CGCGCTTGAA TCACAGATCA 
TATCGAAACT TGGGATTGAC GGGTCTGGCT 
5 CAATCAAGGA GCGATTTATT TAAACGAAAT 
CTGGGCACTC CCGCTACCCA GCTATGATGG 
GAAATATTAG TAGAGCAATT GATTGCACTG 
AACACATTAC AATTGATCAA TAAAAACCAT 
GCCCCCGCAC TTAGTG 



CTTGTCGACG GTTTTTTTGA 800 

CACGATCACT GTCCCAGCAC 850 

AGGAGCAGGC ACAGCTGCTT 90 0 

CGAATCGATT TTTTCGTCAC 950 

CAACACCATG CCGGGATTTA 1000 

CGGAAGTCGG GTTATCCTAC 105 0 

GCAGAGGAGG ACAAACGATG 1100 

CCATTGAAAA AAAATCAAGA 1150 

1166 



2) INFORMATION FOR SEQ ID NO: 1060 

15 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1028 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



20 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus casseliflavus 
25 (B) STRAIN: ATCC 2578 8 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1060 

* AACATGAAAA AAATCGCCTT ATTTTTGGAG GCAATTCACC GGAATACACC . 50 

3 0" GTTTCTTTAG CTTCAGCAAC TAGCGCAATC GAAGCACTCC AATCATCTCC 100 

CTATGACTAC GACCTCTCTT TGATCGGGAT CGCCCCAGAT GCTATGGATT 150 

GGTACTTGTA TACAGGAGAA CTGGAAAACA TCCGACAAGA CACGTGGTTG 2 00 

TTGGATACGA AACATAAACA GAAAATACAG CCGCTATTCG AAGGAAACGG 250 

CTTTTGGCTA AGTGAAGAGC AGCAAACGTT GGTACCTGAT GTTTTATTTC 3 00 

35 CCATTATGCA TGGCAAATAC GGGGAAGATG GCAGTATCCA AGGATTGTTT 350 

GAATTGATGA AGCTGCCTTA TGTAGGCTGC GGGGTGGCAG GTTCTGCCTT 400 

ATGTATGAAC AAATGGCTGC TGCATCAAGC TGCAGCAGCC ATTGGCGTAC 450 

AAAGTGCTCC TACGATTCTC TTGACAAATC AAGCCAACCA GCAAGAACAA 500 

ATCGAAGCTT TTATCCAGAC CCATGGCTTC CCAGTTTTCT TTAAGCCTAA 550 

40 TGAAGCGGGC TCCTCAAAAG GGATCACTAA AGTCACCTGC GTTGAAGAAA 600 

TCGCTTCTGC CTTAAAAGAA GCCTTTACTT ATTGTTCCGC AGTGCTCCTA 650 

CAAAAAAATA TTGCCGGTGT TGAGATCGGT TGCGGTATTT TGGGCAACGA 700 

CTCTTTGACT GTCGGTGCTT GTGACGCCAT TTCATTAGTA GACGGCTTTT 750 

TCGATTTTGA AGAAAAGTAC CAGCTGATCA GCGCCAAAAT CACCGTCCCT 800 

45 GCGCCATTGC CTGAAACGAT TGAAACCAAG GTCAAAGAAC AAGCTCAGCT 850 

GCTCTATCGT AGTCTTGGTC TTAAAGGTCT TGCTCGCATC GACTTTTTTG 900 

TCACGGAGCG AGGAGAACTA TACTTGAATG AAATCAATAC TATGCCGGGC 950 

TTTACGAGTC ACTCCCGCTA TCCTGCCATG ATGGCAGCGG TCGGCTTATC 1000 

CTATCAAGAA CTACTACAAA AACTGCTT 1028 

50 

2) INFORMATION FOR SEQ ID NO: 1061 

55 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 103 0 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



60 



601 
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(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus casseli flaws 
5 (B) STRAIN: R689 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1061 

AATATGAAAA AAATCGCCTT ATTTTTGGAG GCAATTCACC GGAGTACGCC 50 

10 GTTTCTTTAG CCTCAGCAAC TAGCGCAATC GAAGCACTCC AATCATCTCC 100 

CGATGACTAT GACCTCTCTT TGATCGGGAT CGCCCCAGAT GCTATGGATT 150 

GGTATTTGTA TACAGGAGAA CTGGAAAACA TCCGACAAGA CACGTGGTTG 200 

TTGGATACGA AACATAAACA GAAAATC C AG CCGCTTTTTG AAGGAAACGG 250 

CTTTTGGCTA AGTGAAGAGC AACAAACGTT GGTTCCTGAT GTTTTATTTC 3 00 

15 CCATTATGCA TGGCAAATAC GGGGAAGATG GCAGTATCCA AGGATTGTTT 350 

GAATTGATGA AACTACCTTA TGTAGGCTGC GGGGTGGCAG CCTCTGCCTT 400 

ATGTATGAAC AAATGGCTGC TGCATCAAGC AGCAGAAGCG ATTGGCGTAC 450 

AAAGTGCTCC TACGATTCTC TTGACAAATC AAGCCAACCA GCAAGATCAA 500 

ATCGAAGCTT TTATCCAGAC CCATGGCTTT CCGGTTTTTT TTAAGCCTAA 550 

20 TGAAGCGGGC TCCTCAAAAG GGATCACTAA AGTCACCTGC GTTGAAGAAA 600 

TCGCTTCTGC CTTAAAAGAA GCCTTTACTT ATTGTTCAGC AGTGCTCCTA 650 

CAAAAAAATA TTGC CGGTGT TGAGATCGGT TGCGGTATTT TGGGCAACGA 700 

CTCTTTGACT GTCGGTGCTT GTGACGCTAT TTCATTAGTA GACGGCTTTT 75 0 

TCGATTTTGA AGAAAAGTAC CAGCTGATCA GCGCCAAGAT CACCGTTCCT 8 00 

25 GCACCATTGC CTGAAACGAT TGAAAC CAAG GTCAAAGAAC AAGCTCAGCT 850 

GCT CTATCGT AGTCTTTGTC TTAAAGGTCT TGCTCGCATC GACTTTTTTG 900 

TCACGGATCA AGGAGAACTA TACTTGAATG AAATCAATAC TATGCCGGGC 950 

TTTACGAGTC ACTCCCGCTA TCCTGCCATG ATGGCAGCGA TCGGCTTATC 1000 
CTATCAAGAA CTACTACAAA AACTGCTTGT . 1030 

30 



2) INFORMATION FOR SEQ ID NO: 1062 

35 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1031 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



40 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus casseliflaws 
45 (B) STRAIN: R754 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1062 

AAACATGAAA AAAATCGCCA TTATTTTTGG AGGCAATTCA CCGGAATACA 50 

50 CCGTTTCTTT AGCTTCAGCA ACTAGCGCAA TCGAAGCACT CCAATCATCT 100 

CCCTATGACT ACGACCTCTC TTTGATCGGG ATCGCCCCAG ATGCTATGGA 150 

TTGGTACTTG TATACAGGAG AACTGGAAAA CATCCGACAA GACACGTGGT 2 00 

TGTTGGATAC GAAACATAAA CAGAAAATAC AGCCGCTATT CGAAGGAAAC 250 

GGCTTTTGGC TAAGTGAAGA GCAGCAAACG TTGGTACCTG ATGTTTTATT 300 

55 TCCCATTATG CATGGCAAAT ACGGGGAAGA TGGCAGTATC CAAGGATTGT 350 

TTGAATTGAT GAAGCTGCCT TATGTAGGCT GCGGGGTGGC AGGTTCTGCC 4 00 

TTATGTATGA ACAAATGGCT GCTGCATCAA GCTGCAGCAG CCATTGGCGT 450 

ACAAAGTGCT CCTACGATTC TCTTGACAAA TCAAGCCAAC CAGCAAGAAC 500 

AAATCGAAGC TTTTATCCAG ACCCATGGCT TCCCAGTTTT CTTTAAGCCT 550 

60 AATGAAGCGG GCTCCTCAAA AGGGATCACT AAAGTCACCT GCGTTGAAGA 600 

602 
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AATCGCTTCT GCCTTAAAAG AAGCCTTTAC 
TACAAAAAAA TATTGCCGGT GTTGAGATCG 
GACTCTTTGA CTGTCGGTGC TTGTGACGCC 
TTTCGATTTT GAAGAAAAGT ACCAGCTGAT 
5 CTGCGCCATT GCCTGAAACG ATTGAAACCA 
CTGCTCTATC GTAGTCTTGG TCTTAAAGGT 
TGTCACGGAG CGAGGAGAAC TATACTTGAA 
GCTTTACGAG TCACTCCCGC TATCCTGCCA 
TCCTATCAAG AACTACTACA AAAACTGCTT 



TTATTGTTCC GCAGTGCTCC 6 50 

GTTGCGGTAT TTTGGGCAAC 700 

ATTTCATTAG TAGACGGCTT 750 

CAGCGCCAA& ATCACCGTCC 800 

AGGTCAAAGA ACAAGCTCAG 850 

CTTGCTCGCA TCGACTTTTT 900 

TGAAATCAAT ACTATGCCGG 950 

TGATGGCAGC GGTCGGCTTA 10 00 

G 1031 



2) INFORMATION FOR SEQ ID NO: 1063 

15 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1030 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



20 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus casseliflavus 
25 (B) STRAIN: R775 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1063 

AACATGAAAA AAATCGCCAT TATTTTTGGA GGCAATTCAC CGGAATACAC 50 

30 CGTTTCTTTA GCTTCAGCAA CTAGCGCAAT CGAAGCACTC CAATCATCTC 100 

CCTATGACTA CGACCTCTCT TTGATCGGGA TCGCCCCAGA TGCTATGGAT 150 

TGGTACTTGT ATACAGGAGA ACTGGAAAAC ATCCGACAAG ACACGTGGTT 2 00 

GTTGGATACG AAACATAAAC AGAAAATACA GCCGCTATTT GAAGGAAACG 250 

GCTTTTGGCT AAGTGAAGAG CAGCAAACGT TAGTACCTGA TATTTTATTT 300 

35 CCCATTATGC ATGGCAAATA CGGGGAAGAT GGCAGTATCC AAGGATTGTT 350 

TGAATTGATG AAACTACCTT ATGTAGGTTG CGGGGTGGCA GGTTCTGCCT 400 

TATGTATGAA CAAATGGCTG CTGCATCAAG CTGCAGCAGC CATTGGCGTA 450 

CAAAGTGCTC CTACGATTCT CTTGACAAAT CAAGCCAACC AGCAAGAACA 500 

AATCGAAGCT TTTATCCAGA CCCATGGCTT CCCAGTTTTC TTTAAGCCTA 55 0 

40 ATGAAGCGGG CTCTTCAAAA GGGATCACTA AAGTCACCTG CGTTGAAGAA 600 

ATCGCTTCTG CCTTAAAAAA AGCCTTTACT TATTGTTCCG CAGTGCTCCT 650 

ACAAAAAAAT ATTGCCGGTG TTGAGATCGG TTGCGGTATT TTGGGCAACG 700 

ACTCTTTGAC TGTCGGTGCT TGTGACGCCA TTTCATTAGT AGACGGCTTT 750 

TTCGATTTTG AAGAAAAGTA CCAGCTGATC AGCGCCAAAA TCACCGTCCC 8 00 

45 TGCGCCATTG CCTGAAACGA TTGAAACCAA GGTCAAAGAA CAAGCTCAGC 850 

TGCTCTATCG TAGTCTTGGT CTTAAAGGTC TTGCTCGCAT CGACTTTTTT 900 

GTCACGGATC AAGGAGAACT ATACTTGAAT GAAATCAATA CTATGCCGGG 950 

CTTTACGAGT CACTCCCGTT ATCCTGCCAT GATGGCAGCG GTCGGCTTAT 1000 

CCTATCAAGA ACTACTACAA AAACTGCTTG 103 0 

50 



2) INFORMATION FOR SEQ ID NO; 1064 

55 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1032 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

60 

603 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 
(ii) MOLECULE TYPE: Genomic DNA 



PCT/CA00/01150 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus flavescens 
5 (B) STRAIN: ATCC 49996 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1064 

AAACATGAAA AAAATCGCCA TTATTTTTGG AGGCAATTCA CCGGAATACA 50 

L0 CCGTTTCTTT AGCTTCAGCA ACTAGCGCAA TCGAAGCACT CCAATCATCT 100 

CCCTATGACT ACGACCTCTC TTTGATCGGG ATCGCCCCAG ATGCTATGGA 150 

TTGGTACTTG TATACAGGAG AACTGGAAAA CATCCGACAA GACACGTGGT 2 00 

TGTTGGATAC GAAACAGAAA CAGAAAATAC AGCCGCTATT CGAAGGAAAC 250 

GGCTTTTGGT TAAGTGAAGA GCAGCAAACG TTGGTACCTG ATGTTTTATT 3 00 

L5 TCCCATTATG CATGGCAAAT ACGGGGAAGA TGGCAGTATC CAAGGATTGT 350 

TTGAATTGAT GAAGCTACCT TATGTAGGCT GCGGGGTGGC AGGTTCTGCC 4 00 

TTATGTATGA ACAAATGGTT GCTGCATCAA GCTGCAGCAG CCATTGGCGT 450 

ACAAAGTGCT CCTACGATTC TCTTGACAAA TCACGCCAAC CAGCAAGAAC 500 

AAATCGAAGC TTTTATCCAG ACCCATGGCT TTCCAGTTTT CTTTAAGCCT 55 0 

20 AATGAAGCGG GTTCCTCAAA AGGGATCACT AAAGTCACCT GCGTTGAAGA 600 

AATCGCTTCT GCCTTAAAAG AAGCCTTTAC TTATTGTTCC GCAGTGCTCC 650 

TACAAAAAAA TATTGCCGGT GTTGAGATCG GTTGCGGTAT TTTGGGCAAC 70 0 

GACTCTTTGA CTGTCGGTGC TTGTGACGCC ATTTCATTAG TAGACGGCTT 750 

TTTCGATTTT GAAGAAAAGT ACCAGCTGAT CAGCGCCAAA ATCACCGTCC 80 0 

25 CTGCGCCATT GCCTGAAACG ATTGAAACTA AGGTCAAAGA ACAAGCTCAG 850 

CTGCTCTATC GTAGTCTTGG ACTTAAAGGT CTTGCTCGCA TCGACTTTTT 900 

TGTCACGGAT CAAGGAGAA.C TATACTTGAA TGAAATCAAT ACTATGC CGG 95 0 

GCTTTACGAG TCACTCCCGC TATCCTGCCA TGATGGCAGC GGTCGGGTTA 100 0 

TCCTATCAAG AACTACTACA AAAACTACTT GT 1032 

30 



2) INFORMATION FOR SEQ ID NO: 1065 

35 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1034 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



40 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus flavescens 
45 (B) STRAIN: R758 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1065 

AAAAACATGA AAAAAATCGC CATTATTTTT GGAGGCAATT CACCGGAATA 50 

50 CACCGTTTCT TTAGCTTCAG CAACTAGCGC AATCGAAGCA CTCCAATCAT 100 

CTCCCTATGA CTACGACCTC TCTTTGATCG GGATCGCCCC AGATGCTATG 150 

GATTGGTACT TGTATACAGG AGAACTGGAA AACATC CGAC AAGACACGTG 200 

GTTGTTGGAT ACGAAACATA AACAGAAAAT ACAGCCGCTA TTCGAAGGAA 250 

ACGGCTTTTG GCTAAGTGAA GAGCAGCAAA CGTTGGTACC TGATGTTTTA 300 

55 TTTCCCATTA TGCATGGCAA ATACGGGGAA GATGGCAGTA TCCAAGGATT 350 

GTTTGAATTG ATGAAGCTGC CTTATGTAGG CTGCGGGGTG GCAAGTTCTG 4 00 

CCTTATGTAT GAACAAATGG CTGCTGCATC AAGCTGCAGC AGCCATTGGC 4 50 

GTACAAAGTG CTCCTACGAT TCTCTTGACA AATCAAGCCA ACCAGCAAGA 500 

ACAAATCGAA GCTTTTATCC AGACCCATGG CTTTCCAGTT TTCTTTAAGC 550 

60 CTAATGAAGC GGGCTCCTCA AAAGGGATCA CTAAAGTCAC CTGCGTTGAA 600 

604 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 

GAAATCGCTT CTGCCTTAAA AGAAGCCTTT 
CCTACAAAAA AATATTGCCG GTGTTGAGAT 
ACGACTCTTT GACTGTCGGT GCTTGTGACG 
TTTTTCGATT TTGAAGAAAA GTACCAGCTG 
5 CCCTGCGCCA TTGCCTGAAA CGATTGAAAC 
AGCTGCTCTA TCGTAGTCTT GGTCTTAAAG 
TTTGTCACGG ATCAAGGAGA ACTATACTTG 
GGGCTTTACG AGTCACTCCC GCTATCCTGC 
TATCCTATCA AGAACTACTA CAAAAACTGC 



PCT/CA00/01150 

ACTTATTGTT CCGCAGTGCT 65 0 

CGGTTGCGGT ATTTTGGGCA 700 

CCATTTCATT AGTAGACGGC 750 

ATCAGCGCCA AAATCACCGT 80 0 

CAAGGTCAAA GAACAAGCTC 850 

GTCTTGCTCG CATCGACTTT 90 0 

AATGAAATCA ATACTATGCC 950 

CATGATGGCA GCGGTCGGCT 100 0 

TTGT 1034 



2) INFORMATION FOR SEQ ID NO: 1066 

15 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1012 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



20 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus flavescens 
25 (B) STRAIN: R760 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 106 6 

CATGAAAAAA ATCGCCATTA TTTTTGGAGG CAATTCACCG GAATACACCG 5 0 

3 0 TTTCTTTAGC CTCAGCAACT AGCGCAATCG AAGCACTCCA ATCATCTCCC 10 0 

TATGATTACG ACCTCTCTTT GATCGGGATC GCCCCAGATG CTATGGATTG 150 

GTACTTGTAT ACAGGAGAAC TGGAAAACAT CCGACAAGAC ACGTGGTTGT 2 00 

TGGATACGAA ACATACACAG AAAATCCAGC CACTTTTTGA AGGAAACGGC 250 

TTTTGGATAA GTGAAGCGCA GCAAACGTTG GTACCTGATG TTTTATTTCC 3 00 

35 CATTATGCAT GGTAAATACG GGGAAGATGG CAGTATCCAA GGATTGTTTG 350 

AATTGATGAA GCTGCCTTAT GTAGGCTGTG GGGTGGCAGC CTCTGCCTTA 400 

TGTATGAACA AATGGTTATT GCATCAAGCA GCAGCAGCGA TTGGCGTACA 450 

AAGCGCTCCT ACGATTCTCT TGACAAATCA AGCCAACCAG CAAAGACAAA 50 0 

TCGAAGCCTT TATCCAGACC CATGGCTTTC CAGTTTTCTT TAAGCCTAAT 550 

4 0 GAAGCGGGCT CCTCAAAAGG GATCACAAAA GTAACTTGTG TTGAAGAAAT 600 

CGCTCCTGCC TTGAAGGAAG CCTTCGCTTA TTGTTCCGCA GTGCTCTTAC 650 

AAAAAAATAT CGCTGGCGTT GAGATTGGTT GCGGTATCTT AGGCAACGAC 70 0 

TCTTTGACTG TCGGTGCTTG TGACGCTATT TCATTAGTAG ACGGCTTTTT 750 

CGATTTTGAA GAAAAGTACC AGTTGATCAG CGCCAAAATC ACCGTTCCTG 800 

45 CGCCATTGCC TGAAACGATT GAAACCAAAG TCAAAGAACA AGCTCAGCTG 850 

CTCTATCACA GTCTTGGTCT TAAAGGACTT GCTCGCATCG ACTTTTTTGT 900 

CACGGATCAA GGAGAACTAT ACTTGAATGA AATCAATACT ATGCCGGGCT 950 

TTACGAGTCA CTCCCGCTAT CCTGCCATGA TGGCAGCGGT CGGCTTATCC 100 0 

TATCAAGAAT TA 1012 

50 



2) INFORMATION FOR SEQ ID NO: 1067 

55 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 721 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

60 

605 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 PCT/CA00/01 150 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus faecium 
5 (B) STRAIN: R481 

. (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1067 

CTTACGCTTT ATCGATTAGA CACGGGNAGC TTGTCCAATG GGRAGCCGAT 50 

10 TTGATTTTAT GGATGAACGC TCTCATCATG CGGCAAATGG AATATCATGC 100 

AATGAAGCGC AAAATCGCAG ACGTTTGCGC TCCATCATGG AAAACAGTGG 150 

GTTTGAAGCA TATAGCCTCG AATGGTGGCA CTATGTATTA AGAGACGAAC 2 00 

CATACCCCAA TAGCTATTTT GATTTCCCCG TTAAATAAAC TTTTAACCGT 250 

TGCACGGACA AACTATATAA GCTAACTCTT TCGGCAGGAA ACCCGACGTA 3 00 

15 TGTAACTGGT TCTTAGGGAA TTTATATATA GTAGATAGTA TTGAAGATGT 350 

AAGGCAGAGC GATATTGCGG TCATTATCTG CGTGCGCTGC GGCAAGATAG 4 00 

C CTGATAATA AG AC TG AT C G CATAGAGGGG TGGTATTTCA CACCGCCCAT 450 

TGTCAACAGG CAGTTCAGCC TCGTTAAATT CAGCATGGGT ATCACTTATG 500 

AAAATTCATC TACATTGGTG ATAATAGTAA ATCCAGTAGG GCGAAATAAT 550 

2 0 TGACTGTAAT TTACGGGGCA AAACGGCACA ATCTCAAACG AGATTGTGCC 600 

GTTTAAGGGG AAGATTCTAG AAATATTTCA TACTTCCAAC TATATAGTTA 6 50 

AGGAGGAGAC TGAAAATGAA GAAGTTGTTT TTTTTATTGT TATTGTTATT 700 

CTTAATATAC TTAGGTTATT G 721 



25 



40 



2) INFORMATION FOR SEQ ID NO: 1068 



(i) SEQUENCE CHARACTERISTICS: 
3 0 (A) LENGTH: 668 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

3 5 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus faecium 

(B) STRAIN: R492 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1068 



ATTTTTAAGG ATGAACGCTC TTCATCATGC GGCAAATGGA ATATCATGCA 50 

ATGAAGCGCA AAATCGCAGA CGTTTGCGCT CCATCATGGA AAACAGTGGG 100 

45 TTTGAAGCAT ATAGCCTCGA ATGGTGGCAC TATGTATTAA GAGACGAACC 150 

ATACCCCAAT AGCTATTTTG ATTTCCCCGT TAAATAAACT TTTAACCGTT 2 00 

GCACGGACAA ACTATATAAG CTAACTCTTT CGGCAGGAAA CCCGACGTAT 250 

GTAACTGGTT CTTAGGGAAT TTATATATAG TAGATAGTAT TGAAGATGTA 300 

AGGCAGAGCG ATATTGCGGT CATTATCTGC GTGCGCTGCG GCAAGATAGC 350 

50 CTGATAATAA GACTGATCGC ATAGAGGGGT GGTATTTCAC ACCGCCCATT 400 

GTCAACAGGC AGTTCAGCCT CGTTAAATTC AGCATGGGTA TCACTTATGA 450 

AAATTCATCT ACATTGGTGA TAATAGTAAA TCCAGTAGGG CGAAATAATT 500 

GACTGTAATT TACGGGGCAA AACGGCACAA TCTCAAACGA GATTGTGCCG 550 

TTTAAGGGGA AGATTCTAGA AATATTTCAT ACTTCCAACT ATATAGTTAA 600 

55 GGAGGAGACT GAAAATGAAG AAGTTGTTTT TTTTATTGTT ATTGTTATTC 650 

TTAATATACT TAGGTTAT 668 



60 2) INFORMATION FOR SEQ ID NO: 1069 

606 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 



PCT/CAOO/01150 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 760 bases 

(B) TYPE: Nucleic acid 

5 (C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

10 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus faecium 

(B) STRAIN: R581 



15 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1069 



CGGCAAGTGC CATTGATCTT ACGCTTTATC GATTAGACAC GGGTRAGCTT 50 

GTACCAATGG GAAGCCGATT TGATTTTATG GATGAACGCT CTCATCATGC 100 

GGCAAATGGA ATATCATGCA ATGAAGCGCA AAATCGCAGA CGTTTGCGCT 150 

CCATCATGGA AAACAGTGGG TTTGAAGCAT ATAGCCTCGA ATGGTGGCAC 200 

2 0 TATGTATTAA GAGACGAACC ATACCCCAAT AGCTATTTTG ATTTCCCCGT 250 

TAAATAAACT TTTAACCGTT GCACGGACAA ACTATATAAG CTAACTCTTT 300 

CGGCAGGAAA CCCGACGTAT GTAACTGGTT CTTAGGGAAT TTATATATAG 350 

TAGATAGTAT TGAAGATGTA AGGCAGAGCG ATATTGCGGT CATTATCTGC 400 

GTGCGCTGCG GCAAGATAGC CTGATAATAA GACTGATCGC ATAGAGGGGT 450 

25 GGTATTTCAC ACCGCCCATT GTCAACAGGC AGTTCAGCCT CGTTAAATTC 500 

AGCATGGGTA TCACTTATGA AAATTCATCT ACATTGGTGA TAATAGTAAA 550 

TCCAGTAGGG CGAAATAATT GACTGTAATT TACGGGGCAA AACGGCACAA 600 

TCTCAAACGA GATTGTGCCG TTTAAGGGGA AGATTCTAGA AATATTTCAT 650 

ACTTCCAACT ATATAGTTAA GGAGGAGACT GAAAATGAAG AAGTTGTTTT 700 

30 TTTTATTGTT ATTGTTATTC TTAATATACT TAGGTTATGA CTACGTTAAT 750 

GAAANCCTGA 760 



35 2) INFORMATION FOR SEQ ID NO: 1070 

(i> SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 801 bases 

(B) TYPE: Nucleic acid 
40 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

45 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus faecal is 

(B) STRAIN: R610 



50 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 107 0 



AAAAGGAATA CGGGGCTTTC AAAAATCCAA GCCATAACCC GCGGGCAAGT 50 

GCCATTTGAT TCTTACGCTT TAATCGATTA GACACGGGTA AGCTTGTACC 100 

AATGGGGAAC CGATTTGATT TTAATGNATG AACGCTCTTC ATCATGCGGC 150 

AAATGGAATA TCATGCAATG AAGCGCAAAA TCGCAGACGT TTGCGCTCCA 2 00 

55 TCATGGAAAA CAGTGGGTTT GAAGCATATA GCCTCGAATG GTGGCACTAT 250 

GTATTAAGAG ACGAACCATA CCCCAATAGC TATTTTGATT TCCCCGTTAA 300 

ATAAACTTTT AACCGTTGCA CGGACAAACT ATATAAGCTA ACTCTTTCGG 350 

CAGGAAACCC GACGTATGTA ACTGGTTCTT AGGGAATTTA TATATAGTAG 4 00 

ATAGTATTGA AGATGTAAGG CAGAGCGATA TTGCGGTCAT TATCTGCGTG 450 

60 CGCTGCGGCA AGATAGCCTG ATAATAAGAC TGATCGCATA GAGGGGTGGT 500 

607 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 

ATTTCACACC GCCCATTGTC AACAGGCAGT 

ATGGGTATCA CTTATGAAAA TTCATCTACA 

AGTAGGGCGA AATAATTGAC TGTAATTTAC 

CAAACGAGAT TGTGCCGTTT AAGGGGAAGA 

5 TCCAACTATA TAGTTAAGGA GGAGACTGAA 

TTATTGTTAT TGTTATTCTT AATATACTTA 
A 



PCT/CAOO/01150 

TCAGCCTCGT TAAATTCAGC 550 

TTGGTGATAA TAGTAAATCC 600 

GGGGCAAAAC GGCACAATCT 650 

TTCTAGAAAT ATTTCATACT 7 00 

AATGAAGAAG TTGTTTTTTT 750 

GGTTATGACT ACGTTAATGA 8 00 

801 



10 



20 



2) INFORMATION FOR SEQ ID NO: 1071 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 711 bases 
15 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE :- Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus gallinarum 

(B) STRAIN: R684 

25 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10 71 

TTGTACCAAT GGGGAGCCGA TTTGATTTTA TGGATGAACG CTCTCATCAT 50 

GCGGCAAATG GAATATCATG CAATGAAGCG CAAAATCGCA GACGTTTGCG 10 0 

CTCCATCATG GAAAACAGTG GGTTTGAAGC ATATAGCCTC GAATGGTGGC 150 

3 0 ACTATGTATT AAGAGACGAA CCATACCCCA ATAGCTATTT TGATTTCCCC 2 00 

GTTAAATAAA CTTTTAACCG TTGCACGGAC AAACTATATA AGCTAACTCT 250 

TTCGGCAGGA AACCCGACGT ATGTAACTGG TTCTTAGGGA ATTTATATAT 300 

AGTAGATAGT ATTGAAGATG TAAGGCAGAG CGATATTGCG GTCATTATCT 350 

GCGTGCGCTG CGGCAAGATA GCCTGATAAT AAGACTGATC GCATAGAGGG 400 

35 GTGGTATTTC ACACCGCCCA TTGTCAACAG GCAGTTCAGC CTCGTTAAAT 450 

TCAGCATGGG TATCACTTAT GAAAATTCAT CTACATTGGT GATAATAGTA 500 

AATCCAGTAG GGCGAAATAA TTGACTGTAA TTTACGGGGC AAAACGGCAC 550 

AATCTCAAAC GAGATTGTGC CGTTTAAGGG GAAGATTCTA GAAATATTTC 600 

ATACTTCCAA CTATATAGTT AAGGAGGAGA CTGAAAATGA AGAAGTTGTT 650 

40 TTTTTTATTG TTATTGTTAT TCTTAATATA CTTAGGTTAT GACTACGTTA 700 

ATGAAGCACT G 711 



45 2) INFORMATION FOR SEQ ID NO: 1072 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 751 bases 

(B) TYPE: Nucleic acid 
50 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

55 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus faecium 

(B) STRAIN: R688 



60 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1072 

608 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 

GCCATTGATC TTACGCTTTA TCGATTAGAC 
GGGGAGCCGA TTTGATTTTA TGGATGAACG 
GAATATCATG CAATGAAGCG CAAAATCGCA 
GAAAACAGTG GGTTTGAAGC ATATAGCCTC 
5 AAGAGACGAA CCATACCCCA ATAGCTATTT 
CTTTTAACCG TTGC AGGGAC AAACTATATA 
AACCCGACGT ATGTAACTGG TTCTTAGGGA 
ATTGAAGATG TAAGGCAGAG CGATATTGCG 
CGGCAAGATA GCCTGATAAT AAGACTGATC 

10 ACACCGCCCA TTGTCAACAG GCAGTTCAGC 
TATCACTTAT GAAAATTCAT CTACATTGGT 
GGCGAAATAA TTGACTGTAA TTTACGGGGC 
GAGATTGTGC CGTTTAAGGG GAAGATTCTA 
CTATATAGTT AAGGAGGAGA CTGAAAATGA 

15 TTATTGTTAT TCTTAATATA CTTAGGTTAT 
G 



PCT/CA00/01150 

ACGGGTNAGC TTGTACCAAT 50 

CTCTCATCAT GCGGCAAATG 100 

GACGTTTGCG CTCCATCATG 150 

GAATGGTGGC ACTATGTATT 2 00 

TGATTTCCCC GTTAAATAAA 2 50 

AGCTAACTCT TTCGGCAGGA 300 

ATTTATATAT AGTAGATAGT 35 0 

GTCATTATCT GCGTGCGCTG 4 00 

GCATAGAGGG GTGGTATTTC 4 50 

CTCGTTAAAT TCAGCATGGG 500 

GATAATAGTA AATCCAGTAG 550 

AAAACGGCAC AATCTCAAAC 60 0 

GAAATATTTC ATACTTCCAA 650 

AGAAGTTGTT TTTTTTATTG 70 0 

GACTACGTTA ATGAAGCACT 75 0 

751 



20 2) INFORMATION FOR SEQ ID NO: 1073 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 685 bases 

(B) TYPE: Nucleic acid 
25 (C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

<ii) MOLECULE TYPE: Genomic DNA 

30 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus flavescens 

(B) STRAIN: R689 



35 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1073 



ATTTGATTTT ATGGATGAAC GCTCTCATCA TGCGGCAAAT GGAATATCAT 5 0 

GCAATGAAGC GCAAAATCGC AGACGTTTGC GCTCCATCAT GGAAAACAGT 100 

GGGTTTGAAG CATATAGCCT CGAATGGTGG CACTATGTAT TAAGAGACGA 15 0 

ACCATACCCC AATAGCTATT TTGATTTCCC CGTTAAATAA ACTTTTAACC 2 00 

40 GTTGCACGGA CAAACTATAT AAGCTAACTC TTTCGGCAGG AAACCCGACG 2 50 

TATGTAACTG GTTCTTAGGG AATTTATATA TAGTAGATAG TATTGAAGAT 3 00 

GTAAGGCAGA GCGATATTGC GGTCATTATC TGCGTGCGCT GCGGCAAGAT 3 50 

AGCCTGATAA TAAGACTGAT CGCATAGAGG GGTGGTATTT CACACCGCCC 4 00 

ATTGTCAACA GGCAGTTCAG CCTCGTTAAA TTCAGCATGG GTATCACTTA 45 0 

45 TGAAAATTCA TCTACATTGG TGATAATAGT AAATCCAGTA GGGCGAAATA 50 0 

ATTGACTGTA ATTTACGGGG CAAAACGGCA CAATCTCAAA CGAGATTGTG 550 

CCGTTTAAGG GGAAGATTCT AGAAATATTT CATACTTCCA ACTATATAGT 600 

TAAGGAGGAG ACTGAAAATG AAGAAGTTGT TTTTTTTATT GTTATTGTTA 65 0 

TTCTTAATAT ACTTAGGTTA TGACTACGTT AATGA 685 

50 



2) INFORMATION FOR SEQ ID NO: 1074 

55 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 732 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



609 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 



PCT/CA00/01150 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus faecium 
5 (B) STRAIN: R690 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO; 1074 

ATCGATTAGA CACGGGTGAG CTTGTACCAA TGGGGAGCCG ATTTGATTTT 50 

10 ATGGATGAAC GCTCTCATCA TGCGGCAAAT GGAATATCAT GCAATGAAGC 100 

GCAAAATCGC AGACGTTTGC GCTCCATCAT GGAAAACAGT GGGTTTGAAG 150 

CATATAGCCT CGAATGGTGG CACTATGTAT TAAGAGACGA ACCATACCCC 200 

AATAGCTATT TTGATTTCCC CGTTAAATAA ACTTTTAACC GTTGCACGGA 250 

CAAACTATAT AAGCTAACTC TTTCGGCAGG AAACCCGACG TATGTAACTG 300 

15 GTTCTTAGGG AATTTATATA TAGTAGATAG TATTGAAGAT GTAAGGCAGA 350 

GCGATATTGC GGTCATTATC TGCGTGCGCT GCGGCAAGAT AGCCTGATAA 4 00 

TAAGACTGAT CGCATAGAGG GGTGGTATTT CACACCGCCC ATTGTCAACA 450 

GGCAGTTCAG CCTCGTTAAA TTCAGCATGG GTATCACTTA TGAAAATTCA 500 

TCTACATTGG TGATAATAGT AAATCCAGTA GGGCGAAATA ATTGACTGTA 550 

20 ATTTACGGGG CAAAACGGCA CAATCTCAAA CGAGATTGTG CCGTTTAAGG 600 

GGAAGATTCT AGAAATATTT CATACTTCCA ACTATATAGT TAAGGAGGAG 650 

ACTGAAAATG AAGAAGTTGT TTTTTTTATT GTTATTGTTA TTCTTAATAT 7 00 

ACTTAGGTTA TGACTACGTT AATGAAGCAC TG 73 2 



25 



40 



2 ) INFORMATION FOR SEQ ID NO: 1075 



(i) SEQUENCE CHARACTERISTICS: 
3 0 (A) LENGTH: 67 0 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

35 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus galllnarum 

(B) STRAIN: R691 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1075 



TCTCATCATG CGGCAAATGG AATATCATGC AATGAAGCGC AAAATCGCAG 50 

ACGTTTGCGC TCCATCATGG AAAACAGTGG GTTTGAAGCA TATAGCCTCG 10 0 

45 AATGGTGGCA CTATGTATTA AGAGACGAAC CATACCCCAA TAGCTATTTT 150 

GATTTCCCCG TTAAATAAAC TTTTAACCGT TGCACGGACA AACTATATAA 200 

GCTAACTCTT TCGGCAGGAA ACCCGACGTA TGTAACTGGT TCTTAGGGAA 250 

TTTATATATA GTAGATAGTA TTGAAGATGT AAGGCAGAGC GATATTGCGG 300 

TCATTATCTG CGTGCGCTGC GGCAAGATAG CCTGATAATA AGACTGATCG 350 

50 CATAGAGGGG TGGTATTTCA CACCGCCCAT TGTCAACAGG CAGTTCAGCC 400 

TCGTTAAATT CAGCATGGGT ATCACTTATG AAAATTCATC TACATTGGTG 450 

ATAATAGTAA ATCCAGTAGG GCGAAATAAT TGACTGTAAT TTACGGGGCA 500 

AAACGGCACA ATCTCAAACG AGATTGTGCC GTTTAAGGGG AAGATTCTAG 550 

AAATATTTCA TACTTCCAAC TATATAGTTA AGGAGGAGAC TGAAAATGAA 600 

55 GAAGTTGTTT TTTTTATTGT TATTGTTATT CTTAATATAC TTAGGTTATG 650 

ACTACGTTAA TGAAGCACTG 67 0 



60 2) INFORMATION FOR SEQ ID NO: 1076 

610 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 



PCT/CAOO/01150 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 948 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 



10 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Escherichia coli 

(B) STRAIN: DG131/3 serotype 0X3 :H8 

(C) ACCESSION NUMBER: Z36901 



15 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1076 

I 

ATGAAAATAA TAATTTTTAG AGTGCTAACT TTTTTCTTTG TTATCTTTTC 50 

TGTTAATGTG GTTGCGAAGG AATTTACCTT AGATTTCTCG ACAGCAAAGA 100 

CGTATG TAGA TTCGCTGAAT GTCATTCGCT CTGCAATAGG TACTCCATTA 150 

2 0 CAGACTATTT CATCAGGAGG TACGTCTTTA CTGATGATTG ATAGTGGCAC 200 

AGGGGATAAT TTGTTTGCAG TTGATGTCAG AGGGATAGAT CCAGAGGAAG 250 

GGCGGTTTAA TAATCTACGG CTTATTGTTG AACGAAATAA TTTATATGTG 300 

ACAGGATTTG TTAACAGGAC AAATAATGTT TTTTATCGCT TTGCTGATTT 3 50 

TTCACATGTT ACCTTTCCTG GTACAACTGC GGTTACATTG TCTGGTGACA 4 00 

2 5 GTAGCTATAC CACGTTACAG CGTGTTGCGG GGATCAGTCG TACGGGGATG 450 

CAGATAAATC GCCATTCGTT GACTACTTCT TATCTGGATT TAATGTCGCA 500 

TAGCGGAACC TCACTGACGC AGTCTGTGGC AAGAGCGATG TTACGGTTTG 550 

TTACTGTGAC AGCTGAAGCT TTACGTTTTC GGCAAATTCA GAGGGGATTT 600 

CGTACAACAC TTGATGATCT CAGTGGGCGT TCTTATGTAA TGACTGCTGA 650 

3 0 AGATGTTGAT CTTACGTTGA ACTGGGGAAG GTTGAGTAGT GTCCTGCCTG 700 

ACTATCATGG ACAAGACTCT GTTCGTGTTG GAAGAATTTC TTTTGGAAGT 750 

GTTAATGCAA TTCTGGGTAG CGTGGCATTA ATACTGAATT GTCATCATCA 800 

TGCATCGCGA GTTGCCAGAA TTGTACCTAA TGAGTTTCCT TCTATGTGCC 850 

CGGTAGATGG AAGAGTGCGT GGGATTACGC ACAATAAAAT ATTGTGGGAC 900 

35 TCATCCACTC TGGGGGCAAT TTTGATACGC AGGGCTATTA GCAGTTGA 948 



2) INFORMATION FOR SEQ ID NO: 1077 

40 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1259 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
45 (D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 
50 (A) ORGANISM: Escherichia coli 

(B) STRAIN: 94C serotype 048:H21 

(C) ACCESSION NUMBER: extracted from Z37725 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1077 

55 

CACCTGTATA TGAAGTGTAT ATTATTTAAA TGGGTACTGT GCCTGTTACT 50 

GGGTTTTTCT TCGGTATCCT ATTCCCGGGA GTTTACGATA GACTTTTCGA 100 

CCCAACAAAG TTATGTCTCT TCGTTAAATA GTATACGGAC AGAGATATCG 150 

ACCCCTCTTG AACATATATC TCAGGGGACC ACATCGGTGT CTGTTATTAA 200 

60 CCACACCCCA CCGGGCAGTT ATTTTGCTGT GGATATACGA GGGCTTGATG 250 
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TCTATCAGGC 
TTATATGTGG 
TTCAGATTTT 
CAACGGACAG 
5 TCCGGAATGC 
AATGGAGTTC 
TGCGTTTTGT 
AGAGAATTTC 
GCCGGGAGAC 

10 TTCCGGAGTA 
AATAATATAT 
TCATCAGGGG 
AATGTCAGAT 
TGGGAAAGTA 

15 ATATACAACG 
GGTTTTATTT 
AAGGTAAAAT 
AAGGTTGACG 
ACTGCAAAGT 

20 CCTGTGAATC 
GGCATAACC 



GCGTTTTGAC 
CCGGGTTCGT 
ACACATATAT 
CAGTTATACC 
AAATCAGTCG 
AGTGGTAATA 
CACTGTCACA 
GTCAGGCACT 
GTGGACCTCA 
TCGGGGAGAG 
CAGCGATACT 
GCGCGTTCTG 
AACTGGCGAC 
ATACAGCTGC 
GGTAAATAAA 
GCATTAGTTT 
TGAGTTTTCC 
GGAAAGAATA 
GCTCAGCTGA 
AGGCTCCGGA 



CATCTTCGGC 
TAATACGGCA 
CAGTGCCCGA 
ACTCTGCAAC 
TCACTCACTG 
CAATGACCAG 
GCAGAAGCCT 
GTCTGAAACT 
CTCTGAACTG 
GATGGTGTCA 
GGGTACTGTG 
TTCGCGCCGT 
AGGCCCGTTA 
AGCGTTTCTG 
GGAGTTAAGT 
CTGTTAATGC 
AAGTATAATG 
CTGGACCAGT 
CAGGAATGAC 
TTTGCTGAAG 



TGATTATTGA 
ACAAATACTT 
TGTGACAACG 
GTGTCGCAGC 
GTTTCATCAT 
AGATGCATCC 
TACGCTTCAG 
GCTCCTGTGT 
GGGGCGAATC 
GAGTGGGGAG 
GCCGTTATAC 
GAATGAAGAG 
TAAAAATAAA 
AACAGAAAGT 
ATGAAGAAGA 
AATGGCGGCG 
AGGATGACAC 
CGCTGGAATC 
TGTCACAATC 
TGCAGTTTAA 



GCAAAATAAT 3 00 

TCTACCGTTT 350 

GTTTCCATGA 4 00 

GCTGGAACGT 4 50 

ATCTGGCGTT 500 

AGAGCAGTTC 550 

GCAGATACAG 60 0 

ATACCATGAC 650 

AGCAATGTGC 700 

AATATCCTTT 750 

TGAATTGCCA 800 

AGTCAACCAG 850 

CAATACATTA 900 

CACAGTTTTT 950 

TGTTTATGGC 1000 

GATTGTGCTA 1050 

ATTTACAGTG 1100 

TGCAACCGTT 1150 

AAATCCAGTA 1200 

TAATGACTGA 1250 
1259 



25 2) INFORMATION FOR SEQ ID NO: 1078 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 bases 

(B) TYPE: Nucleic acid 
30 (C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 
35 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1078 

AGTTCTGCGT TTTGTCACTG TC 



40 



50 



55 



2) INFORMATION FOR SEQ ID NO: 1079 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 19 bases 
45 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1079 
CGGAAGCACA TTGCTGATT 19 

2 ) INFORMATION FOR SEQ ID NO: 1080 



(i) SEQUENCE CHARACTERISTICS: 
60 (A) LENGTH: 25 bases 
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(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1080 
TATAGCTACT GTCACCAGAC AATGT 25 



2) INFORMATION FOR SEQ ID NO: 1081 

15 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

20 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1081 
25 ATGTCAGAGG GATAGATCCA 20 



2) INFORMATION FOR SEQ ID NO : 1082 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 
35 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1082 

40 

TTGARCRAAA TAATTTATAT GTG 23 

45 2 ) INFORMATION FOR SEQ ID NO : 1083 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 
50 (C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 
55 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1083 

TGATGATGRC AATTCAGTAT 20 
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2) INFORMATION FOR SEQ ID NO: 1084 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 39 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1084 
CCACGCCGCT TTGCTGATTT TTCACATGTT ACCGCGTGG 



2) INFORMATION FOR SEQ ID NO: 1085 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 34 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1085 
CCACGCCACT GTCTGAAACT GCTCCTGTGC GTGG 



2) INFORMATION FOR SEQ ID NO: 1086 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 0 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1086 
CTACTCCCGC CTTTTGGGTT 



2 ) INFORMATION FOR SEQ ID NO: 108 7 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1087 
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20 



5 2 ) INFORMATION FOR SEQ ID NO : 1088 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 
L0 (C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 
15 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1088 

TGCCGTTTCC TGTATCCGTC 20 



20 



30 



35 



60 



2) INFORMATION FOR SEQ ID NO: 1089 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 20 bases 
25 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1089 
ATCCACACGG GCTAGACCTC 2 0 

2 ) INFORMATION FOR SEQ ID NO: 1090 



(i) SEQUENCE CHARACTERISTICS: 
40 (A) LENGTH: 21 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

45 (ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1090 
AATAGCGCGG ACGAATTGGA C 21 

50 

2 ) INFORMATION FOR SEQ ID NO: 1091 

55 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 
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(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1091 
AACGCGGCAC TGTTTCCCAA 



2) INFORMATION FOR SEQ ID NO: 1092 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1092 
TCGGCAAGAC AATATGACAG C 



2) INFORMATION FOR SEQ ID NO: 1093 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 381 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) . TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Staphylococcus saprophyticus 

(B) STRAIN: CSsa-165 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1093 



TAACGGGCGT 
AATCAATATA 
ACAAATAAAA 
CGATTCTGGT 
TTTTAAAGTA 
AATCATGAAA 
TTACAACAAC 
AAACATCCTA 



CTCGATAGAA 
CGATTTTGCT 
ATTAATAGGC 
TTTTTAAATT 
TAGGTATAAA 
CAATTAACTA 
ACTGATTGGC 
TTGATTTTGT 



AAACACGTGA 
GAAGCTGTAA 
AACTTAACCA 
CAAAAAGTTT 
ATACAATTGA 
AGCCTTTATA 
GCGTTACTAT 
GGACGCCCGT 



AAATCCCAAT 
CAAAAGGTAT 
GAATCGTTAA 
TCTAAAAAAT 
TTAAAATAGT 
CTTTTACCTA 
TATATTTGCC 
T 



GATTATAAAC 50 

TAAGGAACAA 100 

AACTATATGA 150 

TTACCTGCTT 2 00 

AAAGGAAATG 250 

TTACTTTTTA 300 

AATCACAGGT 350 

381 



2) INFORMATION FOR SEQ ID NO: 1094 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10 94 
GGTAAAACAG GTACTTCTAA CTA 23 

5 

2 ) INFORMATION FOR SEQ ID NO: 1095 

10 (i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

15 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1095 
20 CGATAGAAGC AGCAGGACAA 2 0 



25 



40 



50 



55 



2) INFORMATION FOR SEQ ID NO: 1096 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 
30 (C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 
35 (Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1096 

CTGATGGATG CGGAAGATAC 20 



2) INFORMATION FOR SEQ ID NO: 1097 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 26 bases 
45 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1097 
TCYTCAAAAG GGATCACWAA AGTMAC 26 

2) INFORMATION FOR SEQ ID NO: 1098 



(i) SEQUENCE CHARACTERISTICS: 
60 (A) LENGTH: 23 bases 
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(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1098 
TCTTCAAAAT CGAAAAAGCC GTC 2 3 

2) INFORMATION FOR SEQ ID NO: 1099 



(i) SEQUENCE CHARACTERISTICS: 
15 (A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

20 (ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1099 
TCAAAAGGGA TCACWAAAGT MAC 23 

25 

2) INFORMATION FOR SEQ ID NO: 1100 

3 0 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

35 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1100 
40 GTAAAKCCCG GCATRGTRTT GATTTC 26 



2) INFORMATION FOR SEQ ID NO: 1101 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 
50 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1101 

55 

GACGGYTTTT TYGATTTTGA AGA 23 



60 2) INFORMATION FOR SEQ ID NO: 1102 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1102 
AAAAARTCGA TKCGAGCMAG ACC 



2) INFORMATION FOR SEQ ID NO: 1103 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1103 
ATCCCGCTAT GAAAACGATC 



2) INFORMATION FOR SEQ ID NO: 1104 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1104 
GGATCAACAC AGTAGAACCG 



2) INFORMATION FOR SEQ ID NO: 1105 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1105 
CTCCTACGAT TCTCTTGAYA AATCA 

619 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 



PCT/CAOO/01150 



2) INFORMATION FOR SEQ ID NO: 1106 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1106 
CAACCGATCT CAACACCGGC AAT 



2) INFORMATION FOR SEQ ID NO: 1107 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1107 
CTCATTTGAC TTCCTCCTTT GCT 



2) INFORMATION FOR SEQ ID NO: 1108 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1108 
GTAAGAATCG GAAAAGCGGA AGG 



2) INFORMATION FOR SEQ ID NO : 110 9 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1109 
ACATCGTGAT CGCTAAAAGG AGC 



2 ) INFORMATION FOR SEQ ID NO: 1110 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1110 
ACGAGAAAGA CAACAGGAAG ACC 



2) INFORMATION FOR SEQ ID NO: 1111 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1111 
CTTTTTCCGG CTCGWYTTCC TGATG 



2) INFORMATION FOR SEQ ID NO: 1112 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1112 
GGCTGYGATA TTCAAAGCTC 



2) INFORMATION FOR SEQ ID NO: 1113 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 
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(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1113 
ACCGACCTCA CAGCCCGAAA 



2) INFORMATION FOR SEQ ID NO: 1114 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1114 
TCWGAGCCTT TTTCCGGCTC G 



2) INFORMATION FOR SEQ ID NO: 1115 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1115 
TTTCGGGCTG TGAGGTCGGB TGHGCG 



2 ) INFORMATION FOR SEQ ID NO: 1116 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1116 
TTTCGGGCTG TGAGGTCGGB TGHGCGG 



2) INFORMATION FOR SEQ ID NO: 1117 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 801 bases 

(B) TYPE: Nucleic acid 

5 (C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

LO (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus faecium 

(B) STRAIN: 94 

(C) ACCESSION NUMBER: U94526 

15 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1117 

AAATTCGATC CGCACTACAT CGGAATTACA AAAAACGGTG TATGGAAGCT 50 

ATGCAAGAAG CCATGTACGG AATGGGAAGC CGACAGTCTC CCCGCCATAC 10 0 

TCTCCCCGGA TAGGAAAACG CATGGGCTGC TTGTCATGAA AGAAAGCGAA 150 

20 TACGAAACAC GGCGTATTGA TGTGGCTTTC CCGGTTTTGC ATGGCAAATG 2 00 

CGGGGAGGAT GGTGCGATAC AGGGGCTGTT TGTATTGTCT GGTATCCCCT 250 

ATGTGGGCTG TGATATTCAA AGCT CCGCAG CTTGCATGGA CAAATCACTG 3 00 

GCCTACATTC TTACAAAAAA TGCGGGCATC GCCGTTCCCG AATTTCAAAT 350 

GATTGATAAA GGTGACAAGC CGGAGGCGGG TGCGCTTACC TACCCTGTCT 400 

25 TTGTGAAGCC GGCACGGTCA GGTTCGTCCT TTGGCGTAAC CAAAGTAAAC 450 

GGTACGGAAG AACTTAACGC TGCGATAGAA GCGGCAGGAC AATATGATGG 500 

AAAAATCTTA ATTGAGCAAG CGATTTCGGG CTGTGAGGTC GGGTGTGCGG 550 

TCATGGGGAA CGAGGATGAT TTGATTGTCG GCGAAGTGGA TCAAATCCGG 600 

CTGAGCCACG GTATCTTCCG CATCCATCAG GAAAACGAGC CGGAAAAAGG 65 0 

3 0 CTCAGAAAAT GCGATGATTA CAGTTCCCGC AGACATTCCG GTCGAGGAAC 7 00 

GAAATCGGGT GCAGGAAACG GCAAAGAAAG TATATCGGGT GCTTGGATGC 750 

AGAGGGCTTG CCCGTGTTGA TCTTTTTTTG CAGGAGGATG GCGGCATCGT 800 

T 801 



35 



2 ) INFORMATION FOR SEQ ID NO: 1118 



(i) SEQUENCE CHARACTERISTICS: 
10 (A) LENGTH: 24 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

15 (ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1118 
TTTTCWGAGC CTTTTTCCGG CTCG 24 

50 



2) INFORMATION FOR SEQ ID NO: 1119 

55 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 
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(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1119 
TTTCGGGCTG TGAGGTCGGB TGHGC 



2) INFORMATION FOR SEQ ID NO: 112 0 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1120 
TTTCGGGCTG TGAGGTCGGB TGHG 



2 ) INFORMATION FOR SEQ ID NO: 1121 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1121 
TGTTTGWATT GTCYGGYATC CC 



2) INFORMATION FOR SEQ ID NO: 1122 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1122 
TGGTGCATTG CTACGTGG 



2) INFORMATION FOR SEQ ID NO: 1123 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 22 bases 
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(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1123 
TTTCGGGCTG TGAGGTCGGB TG 



2) INFORMATION FOR SEQ ID NO: 1124 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1124 
GATTTGRTCC ACYTCGCCRA CA 



2 ) INFORMATION FOR SEQ ID NO: 1125 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY : Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1125 
ACTCACAACT GGGATGGATG 



2 ) INFORMATION FOR SEQ ID NO: 112 6 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 112 6 
TTATGGTTGT GCTGGTTGAG G 
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2) INFORMATION FOR SEQ ID NO: 1127 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1127 
KCAAAYGCCA TTTCAAGTAA 



2) INFORMATION FOR SEQ ID NO: 1128 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 112 8 
GACGACYTTA TKGATATACA 



2 ) INFORMATION FOR SEQ ID NO: 112 9 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 112 9 
ATGATGACHG AMATGATGAA AAC 



2) INFORMATION FOR SEQ ID NO: 1130 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1130 
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5 2 ) INFORMATION FOR SEQ ID NO: 1131 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 bases 

(B) TYPE: Nucleic acid 
10 (C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 
15 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1131 

CATCTGGAGC TACRTARCCA GT 



20 



30 



35 



60 



2) INFORMATION FOR SEQ ID NO: 1132 



(i) SEQUENCE CHARACTERISTICS: 
• (A) LENGTH: 2 0 bases 
25 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY : Linear 



(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1132 
AGTGAAAARA TGGCTGCTGC 20 

2) INFORMATION FOR SEQ ID NO: 1133 



(i) SEQUENCE CHARACTERISTICS: 
40 (A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

45 (ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1133 
CATCAAGAAC ACTGGCTAYG TAG 23 

50 

2) INFORMATION FOR SEQ ID NO: 1134 

55 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 
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(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1134 
CTAGATAGAG CTAAAACCTT CCT 



2) INFORMATION FOR SEQ ID NO: 1135 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1135 
CATTATGCAA ACGCCATTTC AAG 



2) INFORMATION FOR SEQ ID NO : 113 6 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TO POLOG Y : Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1136 
ACTTGTCCAC GTTSGATRTC T 



2) INFORMATION FOR SEQ ID NO: 1137 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1137 
AATTAATGGC TGCWGTTGAY GAA 



2) INFORMATION FOR SEQ ID NO: 1138 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 1032 bases 
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(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



10 



35 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus gallinarum 
(C) ACCESSION NUMBER: M75132 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 113 8 



ATGAAAAAAA TTGCCGTTTT ATTTGGAGGG AATTCTCCAG AATACTCAGT 50 

GTCACTAACC TCAGCAGCAA GTGTGATCCA AGCTATTGAC CCGCTGAAAT 10 0 

15 ATGAAGTAAT GACCATTGGC ATCGCACCAA CAATGGATTG GTATTGGTAT 150 

CAAGGAAACC TCGCGAATGT TCGCAATGAT ACTTGGCTAG AAGATCACAA 20 0 

AAACTGTCAC CAGCTGACTT TTTCTAGCCA AGGATTTATA TTAGGAGAAA 250 

AACGAATCGT CCCTGATGTC CTCTTTCCAG TCTTGCATGG GAAGTATGGC 300 

GAGGATGGCT GTATCCAAGG ACTGCTTGAA CTAATGAACC TGCCTTATGT 350 

2 0 TGGTTGCCAT GTCGCTGCCT CCGCATTATG TATGAACAAA TGGCTCTTGC 4 00 

ATCAACTTGC TGATACCATG GGAATCGCTA GTGCTCCCAC TTTGCTTTTA 450 

TCCCGCTATG AAAACGATCC TGCCACAATC GATCGTTTTA TTCAAGACCA 50 0 

TGGATTCCCG ATCTTTATCA AGCCGAATGA AGC CGGTTCT TCAAAAGGGA 550 

TCACAAAAGT AACTGACAAA ACAGCGCTCC AATCTGCATT AACGACTGCT 600 

25 TTTGCTTACG GTTCTACTGT GTTGATCCAA AAGGCGATAG CGGGTATTGA 650 

AATTGGCTGC GGCATCTTAG GAAATGAGCA ATTGACGATT GGTGCTTGTG 70 0 

ATGCGATTTC TCTTGTCGAC GGTTTTTTTG ATTTTGAAGA GAAATACCAA 750 

TTAATCAGCG CCACGATCAC TGTCCCAGCA CCATTGCCTC TCGCGCTTGA 800 

ATCACAGATC AAGGAGCAGG CACAGCTGCT TTATCGAAAC TTGGGATTGA 850 

30 CGGGTCTGGC TCGAATCGAT TTTTTCGTCA CCAATCAAGG AGCGATTTAT 900 

TTAAACGAAA TCAACACCAT GCCGGGATTT ACTGGGCACT CCCGCTACCC 950 

AGCTATGATG GCGGAAGTCG GGTTATCCTA CGAAATATTA GTAGAGCAAT 100 0 

TGATTGCACT GGCAGAGGAG GACAAACGAT GA 1032 



2) INFORMATION FOR SEQ ID NO: 1139 



(i) SEQUENCE CHARACTERISTICS: 
40 (A) LENGTH: 1768 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

45 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus faecium 

(B) STRAIN: BM4147 

50 (C) ACCESSION NUMBER: X56895 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1139 

GATATCGTTA CGCTTCATGT GCCGCTCAAT ACGGATACGC ACTATATTAT 50 

55 CAGCCACGAA CAAATACAGA GAATGAAGCA AGGAGCATTT CTTATCAATA 100 

CTGGGCGCGG TCCACTTGTA GATACCTATG AGTTGGTTAA AGCATTAGAA 150 

AACGGGAAAC TGGGCGGTGC CGCATTGGAT GTATTGGAAG GAGAGGAAGA 2 00 

GTTTTTCTAC TCTGATTGCA CCCAAAAACC AATTGATAAT CAATTTTTAC 250 

TTAAACTTCA AAGAATGCCT AACGTGATAA TCACACCGCA TACGGCCTAT 3 00 

60 TATACCGAGC AAGCGTTGCG TGATACCGTT GAAAAAACCA TTAAAAACTG 350 
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TTTGGATTTT GAAAGGAGAC AGGAGCATGA ATAGAATAAA AGTTGCAATA 4 00 

CTGTTTGGGG GTTGCTCAGA GGAGCATGAC GTATCGGTAA AATCTGCAAT 450 

AGAGATAGCC GCTAACATTA ATAAAGAAAA ATACGAGCCG TTATACATTG 500 

GAATTACGAA ATCTGGTGTA TGGAAAATGT GCGAAAAACC TTGCGCGGAA 550 

5 TGGGAAAACG ACAATTGCTA TTCAGCTGTA CTCTCGCCGG ATAAAAAAAT 60 0 

GCACGGATTA CTTGTTAAAA AGAACCATGA ATATGAAATC AACCATGTTG 650 

ATGTAGCATT TTCAGCTTTG CATGGCAAGT CAGGTGAAGA TGGATC CAT A 700 

CAAGGTCTGT TTGAATTGTC CGGTATCCCT TTTGTAGGCT GCGATATTCA 750 

AAGCTCAGCA ATTTGTATGG ACAAATCGTT GACATACATC GTTGCGAAAA 8 00 

LO ATGCTGGGAT AGCTACTCCC GCCTTTTGGG TTATTAATAA AGATGATAGG 850 

CCGGTGGCAG CTACGTTTAC CTATCCTGTT TTTGTTAAGC CGGCGCGTTC 900 

AGGCTCATCC TTCGGTGTGA AAAAAGTCAA TAGCGCGGAC GAATTGGACT 950 

ACGCAATTGA ATCGGCAAGA CAATATGACA GCAAAATCTT AATTGAGCAG 1000 

GCTGTTTCGG GCTGTGAGGT CGGTTGTGCG GTATTGGGAA ACAGTGCCGC 1050 

L5 GTTAGTTGTT GGCGAGGTGG ACCAAATCAG GCTGCAGTAC GGAATCTTTC 1100 

GTATTCATCA GGAAGTCGAG CCGGAAAAAG GCTCTGAAAA CGCAGTTATA 1150 

ACCGTTCCCG CAGAC CTTTC AGCAGAGGAG CGAGGACGGA TACAGGAAAC 12 00 

GGCAAAAAAA ATATATAAAG CGCTCGGCTG TAGAGGTCTA GCCCGTGTGG 12 50 

ATATGTTTTT ACAAGATAAC GGCCGCATTG TACTGAACGA AGTCAATACT 13 0 0 

20 CTGCCCGGTT TCACGTCATA CAGTCGTTAT CCCCGTATGA TGGCCGCTGC 13 50 

AGGTATTGCA CTTCCCGAAC TGATTGACCG CTTGATCGTA TTAGCGTTAA 1400 

AGGGGTGATA AGCATGGAAA TAGGATTTAC TTTTTTAGAT GAAATAGTAC 1450 

ACGGTGTTCG TTGGGACGCT AAATATGCCA CTTGGGATAA TTTCACCGGA 15 00 

AAACCGGTTG ACGGTTATGA AGTAAATCGC ATTGTAGGGA CATACGAGTT 1550 

25 GGCTGAATCG CTTTTGAAGG CAAAAGAACT GGCTGCTACC CAAGGGTACG 1600 

GATTGCTTCT ATGGGACGGT TACCGTCCTA AGCGTGCTGT AAACTGTTTT 1650 

ATGCAATGGG CTGCACAGCC GGAAAATAAC CTGACAAAGG AAAGTTATTA 1700 

TCCCAATATT GACCGAACTG AGATGATTTC AAAAGGATAC GTGGCTTCAA 1750 

AATCAAGCCA TAGCCGCG 1768 

30 

2) INFORMATION FOR SEQ ID NO: 1140 

35 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1086 bases 

(B) TYPE : Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



40 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: EntGrocaccus cassellflavus 
45 (C) ACCESSION NUMBER: L29638 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1140 

GTAAGAATCG GAAAAGCGGA AGGAAGAAAA ACATGAAAAA AATCGCCATT 50 

50 ATTTTTGGAG GCAATTCACC GGAATACACC GTTTCTTTAG CTTCAGCAAC 100 

TAGCGCAATC GAAGCACTCC AATCATCTCC CTATGACTAC GACCTCTCTT 150 

TGATCGGGAT CGCCCCAGAT GCTATGGATT GGTACTTGTA TACAGGAGAA 200 

CTGGAAAACA TCCGACAAGA CACGTGGTTG TTGGATACGA AACATAAACA 250 

GAAAATACAG CCGCTATTCG AAGGAAACGG CTTTTGGCTA AGTGAAGAGC 300 

55 AGCAAACGTT GGTACCTGAT GTTTTATTTC CCATTATGCA TGGCAAATAC 350 

GGGGAAGATG GCAGTATCCA AGGATTGTTT GAATTGATGA AGCTGCCTTA 400 

TGTAGGCTGC GGGGTGGCAG GTTCTGCCTT ATGTATGAAC AAATGGCTGC 450 

TGCATCAAGC TGCAGCAGCC ATTGGCGTAC AAAGTGCTCC TACGATTCTC 500 

TTGACAAATC AAGCCAACCA GCAAGAACAA ATCGAAGCTT TTATCCAGAC 550 

60 CCATGGCTTC CCAGTTTTCT TTAAGCCTAA TGAAGCGGGC TCCTCAAAAG 600 
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GGATCACTAA AGTCACCTGC GTTGAAGAAA 
GCCTTTACTT ATTGTTCCGC AGTGCTCCTA 
TGAGATCGGT TGCGGTATTT TGGGCAACGA 
GTGACGCCAT TTCATTAGTA GACGGCTTTT 
5 CAGCTGATCA GCGCCAAAAT CACCGTCCCT 
TGAAACCAAG GTCAAAGAAC AAGCTCAGCT 
TTAAAGGTCT TGCTCGCATC GACTTTTTTG 
TACTTGAATG AAATCAATAC TATGCCGGGC 
TCCTGCCATG ATGGCAGCGG TCGGCTTATC 
LO AACTGCTTGT CTTAGCAAAG GAGGAAGTCA 



PCT/CA00/01150 

TCGCTTCTGC CTTAAAAGAA 650 

CAAAAAAATA TTGCCGGTGT 700 

CTCTTTGACT GTCGGTGCTT 750 

TCGATTTTGA AGAAAAGTAC 80 0 

GCGCCATTGC CTGAAACGAT 850 

GCTCTATCGT AGTCTTGGTC 900 

TCACGGAGCG AGGAGAACTA 950 

TTTACGAGTC ACTCCCGCTA 1000 

CTATCAAGAA CTACTACAAA 1050 

AATGAG 1086 



2) INFORMATION FOR SEQ ID NO: 1141 

L5 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 394 6 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY : Linear 



20 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus faecium 
25 (B) STRAIN: BM4147 

(C) ACCESSION NUMBER: extracted from M972 97 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1141 

30 ATGAATAACA TCGGCATTAC TGTTTATGGA TGTGAGCAGG ATGAGGCAGA 50 

TGCATTCCAT GCTCTTTCGC CTCGCTTTGG CGTTATGGCA ACGATAATTA 100 

ACGCCAACGT GTCGGAATCC AACGCCAAAT CCGCGCCTTT CAATCAATGT 150 

ATCAGTGTGG GACATAAATC AGAGATTTCC GCCTCTATTC TTCTTGCGCT 2 00 

GAAGAGAGCC GGTGTGAAAT ATATTTCTAC CCGAAGCATC GGCTGCAATC 2 50 

35 ATATAGATAC AACTGCTGCT AAGAGAATGG GCATCACTGT CGACAATGTG 3 00 

GCGTACTCGC CGGATAGCGT TGCCGATTAT ACTATGATGC TAATTCTTAT 350 

GGCAGTACGC AACGTAAAAT CGATTGTGCG CTCTGTGGAA AAACATGATT 40 0 

TCAGGTTGGA CAGCGACCGT GGCAAGGTAC TCAGCGACAT GACAGTTGGT 450 

GTGGTGGGAA CGGGCCAGAT AGGCAAAGCG GTTATTGAGC GGCTGCGAGG 50 0 

40 ATTTGGATGT AAAGTGTTGG CTTATAGTCG CAGCCGAAGT ATAGAGGTAA 55 0 

ACTATGTACC GTTTGATGAG TTGCTGCAAA ATAGCGATAT CGTTACGCTT 60 0 

CATGTGCCGC TCAATACGGA TACGCACTAT ATTATCAGCC ACGAACAAAT 650 

ACAGAGAATG AAGCAAGGAG CATTTCTTAT CAATACTGGG CGCGGTCCAC 700 

TTGTAGATAC CTATGAGTTG GTTAAAGCAT TAGAAAACGG GAAACTGGGC 750 

45 GGTGCCGCAT TGGATGTATT GGAAGGAGAG GAAGAGTTTT TCTACTCTGA 800 

TTGCACCCAA AAACCAATTG ATAATCAATT TTTACTTAAA CTTCAAAGAA 850 

TGCCTAACGT GATAATCACA CCGCATACGG CCTATTATAC CGAGCAAGCG 900 

TTGCGTGATA CCGTTGAAAA AACCATTAAA AACTGTTTGG ATTTTGAAAG ' 950 

GAGACAGGAG CATGAATAGA ATAAAAGTTG CAATACTGTT TGGGGGTTGC 10 00 

50 TCAGAGGAGC ATGACGTATC GGTAAAATCT GCAATAGAGA TAGCCGCTAA 1050 

CATTAATAAA GAAAAATACG AGCCGTTATA CATTGGAATT ACGAAATCTG 110 0 

GTGTATGGAA AATGTGCGAA AAACCTTGCG CGGAATGGGA AAACGACAAT 1150 

TGCTATTCAG CTGTACTCTC GCCGGATAAA AAAATGCACG GATTACTTGT 1200 

TAAAAAGAAC CATGAATATG AAATCAACCA TGTTGATGTA GCATTTTCAG 1250 

55 CTTTGCATGG CAAGTCAGGT GAAGATGGAT CCATACAAGG TCTGTTTGAA 13 0 0 

TTGTCCGGTA TCCCTTTTGT AGGCTGCGAT ATTCAAAGCT CAGCAATTTG 1350 

TATGGACAAA TCGTTGACAT ACATCGTTGC GAAAAATGCT GGGATAGCTA 1400 

CTCCCGCCTT TTGGGTTATT AATAAAGATG ATAGGCCGGT GGCAGCTACG 1450 

TTTACCTATC CTGTTTTTGT TAAGCCGGCG CGTTCAGGCT CATCCTTCGG 1500 

60 TGTGAAAAAA GTCAATAGCG CGGACGAATT GGACTACGCA ATTGAATCGG 1550 
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CAAGACAATA TGACAGCAAA ATCTTAATTG AGCAGGCTGT TTCGGGCTGT 
GAGGTCGGTT GTGCGGTATT GGGAAACAGT GCCGCGTTAG TTGTTGGCGA 
GGTGGACCAA ATCAGGCTGC AGTACGGAAT CTTTCGTATT CATCAGGAAG 
TCGAGCCGGA AAAAGGCTCT GAAAACGCAG TTATAACCGT TCCCGCAGAC 
5 CTTTCAGCAG AGGAGCGAGG ACGGATACAG GAAACGGCAA AAAAAATATA 
TAAAGCGCTC GGCTGTAGAG GTCTAGCCCG TGTGGATATG TTTTTACAAG 
ATAACGGCCG CATTGTACTG AACGAAGTCA ATACTCTGCC CGGTTTCACG 
TCATACAGTC GTTATCCCCG TATGATGGCC GCTGCAGGTA TTGCACTTCC 
CGAACTGATT GACCGCTTGA TCGTATTAGC GTTAAAGGGG TGATAAGCAT 

10 GGAAATAGGA TTTACTTTTT TAGATGAAAT AGTACACGGT GTTCGTTGGG 
ACGCTAAATA TGCCACTTGG GATAATTTCA CCGGAAAACC GGTTGACGGT 
TATGAAGTAA ATCGCATTGT AGGGACATAC GAGTTGGCTG AAT CGCTTTT 
GAAGGCAAAA GAACTGGCTG CTACCCAAGG GTACGGATTG CTTCTATGGG 
ACGGTTACCG TCCTAAGCGT GCTGTAAACT GTTTTATGCA ATGGGCTGCA 

15 CAGCCGGAAA ATAACCTGAC AAAGGAAAGT TATTATCCCA ATATTGACCG 
AACTGAGATG ATTTCAAAAG GATACGTGGC TTCAAAATCA AGCCATAGCC 
GCGGCAGTGC CATTGATCTT ACGCTTTATC GATTAGACAC GGGTGAGCTT 
GTACCAATGG GGAGCCGATT TGATTTTATG GATGAACGCT CTCATCATGC 
GGCAAATGGA ATATCATGCA ATGAAGCGCA AAATCGCAGA CGTTTGCGCT 

2 0 CCATCATGGA AAACAGTGGG TTTGAAGCAT ATAGCCTCGA ATGGTGGCAC 

TATGTATTAA GAG ACGAAC C ATACCCCAAT AGCTATTTTG ATTTCCCCGT 
TAAATAAACT TTTAACCGTT GCACGGACAA ACTATATAAG CTAACTCTTT 
CGGCAGGAAA CCCGACGTAT GTAACTGGTT CTTAGGGAAT TTATATATAG 
TAGATAGTAT TGAAGATGTA AGGCAGAGCG ATATTGCGGT CATTATCTGC 
25 GTGCGCTGCG GCAAGATAGC CTGATAATAA GACTGATCGC ATAGAGGGGT 
GGTATTTCAC ACCGCCCATT GTCAACAGGC AGTTCAGCCT CGTTAAATTC 
AGCATGGGTA TCACTTATGA AAATTCATCT ACATTGGTGA TAATAGTAAA 
TCCAGTAGGG CGAAATAATT GACTGTAATT TACGGGGCAA AACGGCACAA 
TCTCAAACGA GATTGTGCCG TTTAAGGGGA AGATTCTAGA AATATTTCAT 

3 0 ACTTCCAACT ATATAGTTAA GGAGGAGACT GAAAATGAAG AAGTTGTTTT 

TTTTATTGTT ATTGTTATTC TTAATATACT TAGGTTATGA CTACGTTAAT 
GAAGCACTGT TTTCTCAGGA AAAAGTCGAA TTTCAAAATT ATGATCAAAA 
TCCCAAAGAA CATTTAGAAA ATAGTGGGAC TTCTGAAAAT ACCCAAGAGA 
AAACAATTAC AGAAGAACAG GTTTATCAAG GAAATCTGCT ATTAATCAAT 

35 AGTAAATATC CTGTTCGCCA AGAAAGTGTG AAGTCAGATA TCGTGAATTT 
ATCTAAAGAT GACGAATTAA TAAATGGATA CGGGTTGCTT GATAGTAATA 
TTTATATGTC AAAAGAAATA GCACAAAAAT TTTCAGAGAT GGTCAATGAT 
GCTGTAAAGG GTGGCGTTAG TCATTTTATT ATTAATAGTG GCTATCGAGA 
CTTTGATGAG CAAAGTGTGC TTTACCAAGA AATGGGGGCT GAGTATGCCT 

40 TACCAGCAGG TTATAGTGAG CATAATTCAG GTTTATCACT AGATGTAGGA 
TCAAGCTTGA CGAAAATGGA ACGAGCCCCT GAAGGAAAGT GGATAGAAGA 
AAATGCTTGG AAATACGGGT TCATTTTACG TTATCCAGAG GACAAAACAG 
AGTTAACAGG AATTCAATAT GAACCATGGC ATATTCGCTA TGTTGGTTTA 
CCACATAGTG CGATTATGAA AGAAAAGAAT TTCGTTCTCG AGGAATATAT 

45 GGATTACCTA AAAGAAGAAA AAACCATTTC TGTTAGTGTA AATGGGGAAA 
AATATGAGAT CTTTTATTAT CCTGTTACTA AAAATACCAC CATTCATGTG 
CCGACTAATC TTCGTTATGA GATATCAGGA AACAATATAG ACGGTGTAAT 
TGTGACAGTG TTTCCCGGAT CAACACATAC TAATTCAAGG AGGTAA 
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2) INFORMATION FOR SEQ 



ID NO: 1142 



(i) 



SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 



55 



60 



(ii) 



MOLECULE TYPE: DNA 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1142 
GATCCTCTAA ATGATTCTCA GGTGG 



2) INFORMATION FOR SEQ ID NO: 1143 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1143 
CAATTAGCTT AGCAATAGGT GTTGG 



2) INFORMATION FOR SEQ ID NO: 1144 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1144 
TGTYTTCCAA GGTTCAGCTC 



2) INFORMATION FOR SEQ ID NO: 1145 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1145 
AACATATTKG GTTGATAGGT 



2) INFORMATION FOR SEQ ID NO: 1146 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 
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(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1146 
GGGATTACCT ATGCCAATAT GAT 23 



2) INFORMATION FOR SEQ ID NO: 1147 



(i) SEQUENCE CHARACTERISTICS: 
15. (A) LENGTH: 24 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

20 (ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1147 
AGCTGTGTTA GCVCGAACAT CTTG 24 

25 

2) INFORMATION FOR SEQ ID NO: 1148 

30 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

35 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1148 
40 GACTTTGTTT GGCGTGATAT 2 0 



2) INFORMATION FOR SEQ ID NO: 1149 



45 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 
50 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1149 

55 

TCCYACWATT TCTTTTTGWG 2 0 



60 2) INFORMATION FOR SEQ ID NO: 1150 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 0 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1150 
TGATAATCAC ACCGCATACG 



2 ) INFORMATION FOR SEQ ID NO: 1151 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 0 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1151 
TGCTGTCATA TTGTCTTGCC 



2) INFORMATION FOR SEQ ID NO: 1152 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1152 
ATAAAGATGA TAGGCCGGTG 



2) INFORMATION FOR SEQ ID NO: 1153 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 0 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1153 
CTCGTATGTC CCTACAATGC 
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2) INFORMATION FOR SEQ ID NO: 1154 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 0 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1154 
GTTTGAAGCA TATAGCCTCG 



2) INFORMATION FOR SEQ ID NO: 1155 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1155 
CAGTGCTTCA TTAACGTAGT C 



2 ) INFORMATION FOR SEQ ID NO: 1156 

(i) SEQUENCE CHARACTERISTICS: \ 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1156 
GTTGAAATGC ATCACGAACA ATT 



2) INFORMATION FOR SEQ ID NO: 1157 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1157 
AAGAACGTTT CAGTTAAGGA AAT 



2 ) INFORMATION FOR SEQ ID NO: 1158 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1158 
AAGAGGTAAT GTCTGTGGT 



2) INFORMATION FOR SEQ ID NO: 1159 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY : Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1159 
TGAAGGTTTG CCAGGTGA 



2) INFORMATION FOR SEQ ID NO: 1160 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1160 
CGTTTCTGTT AAAGAAATTA GAAG 



2) INFORMATION FOR SEQ ID NO: 1161 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 bases 

(B) TYPE: Nucleic acid 
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(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1161 
TCCAGGTGAT AACGTTGG 



2) INFORMATION FOR SEQ ID NO: 1162 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1162 
CAAGTCCGTG GAAATGCA 



2) INFORMATION FOR SEQ ID NO: 1163 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY : Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1163 
GTTGGTTTCA ACGTTAAGAA C 



2) INFORMATION FOR SEQ ID NO: 1164 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1164 
GGTTTCAACG TCAAGAAC 



2) INFORMATION FOR SEQ ID NO: 1165 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1165 
GGTTTCAACG TGAAGAAC 



2) INFORMATION FOR SEQ ID NO: 1166 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1166 
ACGTTAAGAA TGTTTCTGTC AA 



2) INFORMATION FOR SEQ ID NO: 1167 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1167 
GTTGGTTTCA ACGT 



2 ) INFORMATION FOR SEQ ID NO: 1168 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1168 
GAACAATTGG TTGAAGGTGT 
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10 



15 



20 



2) INFORMATION FOR SEQ ID NO: 1169 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2160 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus pneumoniae 

(B) STRAIN: SP-665 

(C) ACCESSION NUMBER: AP13 9883 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1169 



35 



ATGAACAAAC CAACGATTCT GCGCCTAATC 
CTTAAGCTTG GTTATCGCAG CCATTGTCTT 
ACTACGTTAG CAAGGCTCCT AGCCTATCCG 
ACTTCTAGTA AAATCTACGA CAATAAAAAT 
25 TTCTGAACGC CGCGTCAATG CCCAAGCTAA 
TTAAGGCAAT CGTTTCTATC GAAGACCATC 
ATTGATTCCA TCCGTATCCT GGGAGCTTTC 
TTCCCTCCAA GGTGGATCAA CTCTCACCCA 
ACTTTTCAAC CTCGACTTCC GACCAGACTA 
3 0 GCTTGGTTAG CGATTCAGTT AGAACAAAAA 
GACCTACTAT ATAAATAAGG TCTACATGTC 
AGACAGCAGC TCAAAACTAC TATGGTAAAG 
CCTCAGTTAG CCTTGCTGGC TGGAATGCCT 
CCCCTATTCA CATCCAGAAG CAGCCCAAGA 
CTGAAATGAA AAATCAAGGT TACATCTCTG 
GTCAATACAC CAATTACTGA TGGACTACAA 
TTACCCTGCT TACATGGATA ATTACCTCAA 
AACAAGAAAC TGGCTATAAC CTTCTAACTA 
AATGTAGACC AAGAAGCTCA AAAACATCTG 
TCAATACGTC TCTTACCCTG ACGATGATTT 
TAGATGTTTC AAATGGTAAA GTCATCGCCC 
GCAAGTAACG TTTCATTTGG TACCAACCAA 
CTGGGGTTCT GCTATGAAAC CAATCACCGA 
ACGGTGTTTA TGATTCCACT GCAACTATGG 
TATCCGGGAA CAAGCACACC TGTCTACAAC 
CTGCAATATG CTCTTCAACA 
GAATAAGGTC GGTCTAGATA 
GGTCTTGGTA TCGACTATCC AAGCATGCAT 
TAATACAACA GAATCTAATA AACAATACGG 
CTGCTGCTTA TGCTGCCTTT GCAAATGGTG 
TATATCCATA AAGTCGTCTT CAGTGATGGA 
TGTCGGAACT CGTGCCATGA AGGAAACGAC 
TGATGAAAAC AGTCTTGACT TATGGAACTG 
TGGCTTCCTC AAGCTGGTAA AACAGGAACC 
AGTTGAAAAC CACATCAAGA ACACTGGCTA 
TTGTTGGTTA TACTCGTAAG TATTCTATGG 
AATCGTTTAA CTCCTATCGT TGGAGATGGT 
TTATCGCTCA ATGATAACGT ATCTATCAGA 
GGACGATGCC AGACGGACTT TTCAGAAACG 
60 GGAGCTCGCC CAATATGGAC TGAACCCTCT 



45 



50 



55 



TAATATTACT 
TTGAGACTTT 



AAGTATCTGA 
AGGCGGAGGA 
AGAGTAAACT 
CAACTCATTG 
TGATATTCCC 
GCTTCTTCGA 
TTGCGCAATC 
ACAGTTGATT 
TTTCTCGTAA 
GCAACCAAAC 
TAATGGCAAC 
ACCTCAATAA 
CAGGCACCAA 
CCGCCGAAAC 
CTGAACAGTA 
AGTCTCAAAT 
GGAGGTCATC 
CTGGGATGGA 
TGGGATATCT 
GCAAGTCGCA 
AACTTGGAGC 
GCTGTGGAAA 
TTATGCACCT 
TTAATGATAT 
TGGGATAGAG 
ATCACGAAAT 
GAGCTAAAAC 
TATGCAAACG 
AGCAAGTAGT 
GCACTTACTA 
AGTAAAAAAG 
AGC CTATATG 
GGCGTGGAGC 
TCTAACTATA 
TGTAGCTCCA 
CTGTATGGAC 
TTCCTAGTTG 
AGATACTCAT 
GGGAATTTGT 
ACTCAACAAT 
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GCATTAGCTT 


50 


GTTTTTTTCT 


100 


AGTTGCAACA 


150 


C 1 GACTTGGG 


200 


ACAGATTTGG 


250 


C C ACAGGGGG 


300 


TGCAAAGTAA 


350 


AAGTTGACTT 


400 


GGCTCAGGAA 


450 


"A /™1 /"""l TV TV TV m/""tl 1 tl ■ I 

AGGAAATCTT 


500 


TATGGAATGC 


550 


fTii i if f i tv tv /^nsrn'mv 

I 1 I AAGTTTA 


600 


7V /^/**t\ 7v m "A m/"^ tv 

ACCAATATGA 


650 


TTGGTCTTAT 


700 


n~\f~~\ TV y**1 TV TV TV /~1 TV 

TGAGAAAGCA 


750 


CAGCAAGTAA 


800 


AATCAAGTAG 


850 


TGTTTACACA 


900 


ACAACTCCGA 


950 


TCTACGGTCG 


1000 


TCGTCACCAA 


1050 


CCAATCGTGA 


1100 


GCCATAGAAT 


1150 


TCCTTATAAC 


1200 


CATATTTCGG 


1250 


GTCACAGCCG 


1300 


CTTCCTTAAT 


1350 


CCATTTCAAG 


1400 


GAAAAAATGG 


1450 


TAAAC CAATG 


1500 


AGTTCTCTAA 


1550 


ATGACCGACA 


1600 


CTATCTTCCT 


1650 


CAGATGAGGA 


1700 


GATGAAATGT 


1750 


AGGTTATTCG 


1800 


CAGCTAAAGT 


1850 


CCAGAAGACT 


1900 


ATTCAAAAAT 


1950 


CCTCAACAGC 
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TGAAAGTTCA AGCTCATCAT 
CCACTCCAAG CACAAATAAT 
CAACAATCAA ATACAACCCC 
ACAACCATAA 



CAGATAGTTC AACTTCACAG 
AGTACGACTA CCAATCCTAA 
TGATCAACAA AATCAGAATC 



TCTAGCTCAA 2050 
CAATAATACG 2100 
CTCAACCAGC 2150 
2160 



2) INFORMATION FOR SEQ ID NO: 1170 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 0 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY : Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1170 
ACGAATTGGA CTACGCAATT 2 0 



2) INFORMATION FOR SEQ ID NO: 1171 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 0 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1171 
ACGAGGATGA TTTGATTGTC 



2 ) INFORMATION FOR SEQ ID NO: 1172 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1560 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

( i i ) MOLECULE . TYPE : Genomi c DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus pneumoniae 

(B) STRAIN: 64147 

(C) ACCESSION NUMBER: X13136 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1172 

GATCCTCTAA ATGATTCTCA GGTGGCTGTT ATTGCCTCTA TTTCAAAGGA 50 

GATGCCTGGC ATTAGTATTT CTACTTCTTG GGATAGAAAG GTTTTGGAAA 100 

CTTCCCTTTC TTCTATAGTT GGGAGTGTAT CCAGTGAAAA AGCTGGTCTC 150 

CCAGCGGAAG AAGCAGAAGC CTATCTTAAA AAAGGCTATT CTCTAAATGA 200 
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CCGTGTAGGA ACCTCCTATT TGGAAAAGCA ATATGAAGAG ACCTTACAAG 250 

GAAAACGCTC GGTAAAAGAA ATCCATCTGG ACAAATATGG CAACATGGAA 3 00 

AGCGTGGATA CAATTGAGGA AGGTAGTAAG GGAAACAATA TCAAGCTGAC 350 

CATTGATTTG GCCTTCCAAG ATAGCGTGGA TGCTTTGCTG AAAAGTTATT 400 

5 TCAATTCAGA GTTGGGAAAT GGTGGAGCCA AGTATTCTGA AGGTGTCTAT 450 

GCAGTCGCCC TTAACCCAAA AACAGGTGCT GTTTTGTCTA TGTCAGGAAT 500 

TAAACATGAC TTGAAAACAG GAGAGTTGAC GCCGGATTCC TTGGGAACGG 550 

TAACCAATGT CTTTGTCCCA GGTTCGGTTG TCAAAGCAGC GACCATCAGC 6 00 

TCTGGTTGGG AAAATGGAGT CTTGTCAGGA AATCAGACCT TGACAGACCA 650 

LO GTC CATTGTC TTTCAAGGTT CAGCTCCAAT TAATTCTTGG TATCCTGCCT 700 

TTTCTAGACC AATGCCGATT ACGGCGGTTC AGGCTCTAGA GTATTCATCC 750 

AATGCTTATA TGGTCCAAAC AGCCCTAGGT CTTATGGGGC AGACCTATCA 8 00 

ACCCAATATG TTTGTCGGCA CCAGCAATCT AGAGTCTGCT ATGGGGAAAT 850 

TGCGTTCAAC CTTTGGTGAA TATGGTTTGG GTTCTGCGAC CGGAATTGAC 900 

15 CTACCAGATG AATCTACTGG ATTTGTTCCC AAAGAGTATA GCTTTGCTAA 950 

TTTCATTACC AATGCCTTTG GGCAGTTTGA TAACTATACG CCGATGCAGT 1000 

TGGCTCAGTA TGTAGCAACT ATTGCAAATG ATGGTGTTCG TGTGGCTCCT 1050 

CGTATTGTTG AAGGCATTTA TGGTAATAAT GATAAGGGAG GACTGGGTGA 1100 

CTTGATTCAG CAACTGCAAC CGACAGAGAT GAATAAGGTC AATATATCCG 1150 

20 ACTCCGATAT GAGCATCTTG CACCAAGGTT TTTATCAGGT TGCCCATGGT 1200 

ACTAGTGAAT TGACAACTGG ACGTGC CTTT TCAAATGGCG CCTTGGTATC 12 50 

CATTAGCGGA AAAACAGGTA CAGCCGAAAG CTATGTGGCA GATGGTCAGC 13 00 

AAGCAACCAA TACCAATGCG GTGGCCTATG CCCCATCTGA TAATCCCCAA 1350 

ATCGCTGTAG CTGTTGTCTT CCCTCATAAC ACCAACCTTA CAAATGGTGT 14 00 

25 CGGACCTTCC ATTGCGCGCG ATATTATCAA CCTCTATAAC CAACATCATC 1450 

CAATGAATTA GAAAGGAACA TATGCTTTAT CCAACACCTA TTGCTAAGCT 1500 

AATTGACAGT TATTCGAAGT TAC CGGGTAT CGGGATTAAG ACGGCTACCC 1550 

GTTTGGCCTT 1560 



30 



2) INFORMATION FOR SEQ ID NO: 1173 



(i) SEQUENCE CHARACTERISTICS:' 
3 5 (A) LENGTH: 2007 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY : Linear 

40 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus pneumoniae 

(B) STRAIN: CS109 

45 (C) ACCESSION NUMBER: 249096 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1173 

GAGTCCCGAT TGCTGAGGAT GCAACCTCCT ATAATGTCTA TGCGGTCATT 50 

50 GATGAGAACT ATAAGTCAGC AACGGGTAAG ATTCTTTACG TAGAAAAAAC 100 

ACAATTTAAC AAGGTTGCAG AGGTCTTTCA TAAGTATCTG GACATGGAAG 150 

AATCCTATGT AAGAGAGCAA CTCTCGCAAC CTAATCTCAA GCAAGTTTCC 200 

TTTGGAGCAA AGGGAAATGG GATTACCTAT GCCAATATGA TGACTATCAA 2 50 

AAAAGAGTTG GAAACTGCAG AGGTCAAGGG GATTGATTTT ACAACCAGTC 3 00 

55 CCAATCGTAG TTATCCAAAC GGACAATTTG CTTCTAGTTT TATCGGTCTA 350 

GCTCAGCTCC ATGAAAATGA AGATGGCAGC AAGAGCTTGC TGGGAACTTC 400 

TGGAATGGAG AGTTCCTTGA ACAGTATTCT TGCAGGGACA GACGGCATTA 450 

TTACCTATGA AAAGGATCGT CTGGGCAATA TTGTACCCGG AACAGAACTG 500 

GTATCGCAAC AAACTGTGGA TGGCAAGGAT GTTTATACAA CATTGTCTAG 550 

60 TCCGCTACAA TCTTTCATGG AAACTCAGAT GGATGCCTTT CTAGAAAAAG 600 
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TAAAAGGTAA GTATATGACC GCGACCTTGG TCAGTGCAAA GACCGGTGAA 650 

ATTCTCGCTA CCACCCAACG ACCTACCTTT AATGCAGATA CTAAAGAAGG 700 

AATCACTGAG GACTTTGTTT GGCGTGATAT TCTTTATCAA AGTAACTATG 750 

AACCAGGATC AGCCTTTAAG GTCATGATGT TAGCTTCTTC TATTGATAAT 80 0 

5 AATACCTTCC CAAGTGGAGA ATACTTCAAT AGCAGTGAAT TCAAAATAGC 850 

GGATGCGACG ACTCGAGATT GGGATGTTAA TGAGGGTTTG ACTACTGGTG 900 

GGATGATGAC TTTCTCACAA GGTTTCGCTC ACTCCAGTAA TGTTGGAACG 95 0 

AGTCTACTTG AACAAAAAAT GGGAGATGCT ACTTGGTTGG ATTATCTAAA 100 0 • 

ACGCTTTAAA TTTGGGGTTC CAACTCGCTT TGGCTTGACA GATGAATACG 1050 

10 CTGGTCAACT TCCAGCTGAT AATATTGTTA GTATTGCTCA AAGCTCATTT 1100 

GGGCAAGGAA TTTCAGTGAC ACAAACACAA ATGCTTCGTG CCTTTACAGC 1150 

TATTGCTAAT GATGGAGTTA TGCTGGAGCC AAAATTTATA AGTGCTATTT 120 0 

ATGATACTAA CAATCAGTCT GTACGTAAGT CACAAAAAGA AATAGTAGGA 1250 

AATCCTGTTT CCAAAGAGGC AGCAAGCACA ACTCGAAATC ACATGATCTT 13 0 0 

15 AGTTGGGACG GACCCTCTAT ATGGAACTAT GTATAATCAC TACACAGGAA 13 50 

AGCCAATTAT AACAGTTCCT GGACAAAATG TAGCAGTTAA ATCCGGTACG 14 00 

GCTCAAATCG CTGATGAGAA AAATGGAGGA TACTTGGTTG GTTCTACCAA 1450 

TTATATTTTC TCAGTTGTGA CTATGAATCC TGC TGAAAAT CCTGATTTTA 1500 

TCTTGTATGT AACGGTTCAA CAGCCTGAGC ATTATTCAGG TATCCAGTTG 1550 

2 0 GGAGAATTTG CCACCCCAAT CTTGGAGCGG GCTTCAGCTA TGAAAGAATC 16 00 

TCTCAATCTT CAATCTCCAG CCAAAAATTT AGATAAAGTT ACGACAGAAT 1650 

CTTCTTATGC AATGCCTAGC ATCAAGGATA TTTCACCTGG TGAGTTGGCG 170 0 

GAAGCCTTAC GCCGAAATAT TGTGCAACCA ATCGTTGTAG GTACTGGAAC 1750 

AAAGATTAAA GAGACTTCTG TAGAAGAAGG GACCAATCTT GCACCAAACC 18 00 

25 AACAAGTTCT CCTTTTATCG GATAAGGTAG AAGAAATTCC AGACATGTAT 185 0 

AGCTGGAAAA AAGAGACTGC CGAGACCTTT GCTAAATGGT TGGATATTGA 1900 

ACTGGAATTT GAAGGTTCAG GTTCCGTTGT TCAGAAGCAA GATGTTCGGA 1950 

CTAATACAGC TATCAAAAAC ATTAAAAAAA TTAAATTAAC TTTAGGAGAC 2 00 0 

TAATATG 2007 

30 

2 ) INFORMATION FOR SEQ ID NO: 1174 

35 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

40 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1174 
4 5 GAACGTGGTG AAGTTCGC 18 



2) INFORMATION FOR SEQ ID NO: 1175 



50 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 
55 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1175 



60 
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5 2) INFORMATION FOR SEQ ID NO: 1176 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 bases 

(B) TYPE: Nucleic acid 
10 (C) STRAND EDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 
15 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1176 

TACTGGTGTA GAAATGTTC 19 



20 



2) INFORMATION FOR SEQ ID NO: 1177 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 2 0 bases 
25. (B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 



30 



35 



(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1177 
GCTCAACAAG TTCCAGATTA 20 

2) INFORMATION FOR SEQ ID NO: 1178 



(i) SEQUENCE CHARACTERISTICS: 
40 (A) LENGTH: 2456 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

45 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Staphylococcus aureus 

(B) STRAIN: NCTC8325 

50 (C) ACCESSION NUMBER: X52593 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1178 

ATGAACTGAT TATACTTAAC ATTAAAAAAG ATGATAACAC CTTCTACACC 50 

55 TCCATATCAC AAAAAATTAT AACATTATTT TGACATAAAT ACTACATTTG 10 0 

TAATATACTA CAAATGTAGT CTTATATAAG GAGGATATTG ATGAAAAAGA 150 

TAAAAATTGT TCCACTTATT TTAATAGTTG TAGTTGTCGG GTTTGGTATA 2 00 

TATTTTTATG CTTCAAAAGA TAAAGAAATT AATAATACTA TTGATGCAAT 250 

TGAAGATAAA AATTTCAAAC AAGTTTATAA AGATAGCAGT TATATTTCTA 300 

60 AAAGCGATAA TGGTGAAGTA GAAATGACTG AACGTCCGAT AAAAATATAT 350 

644 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 

AATAGTTTAG GCGTTAAAGA TATAAACATT 
AGTATCTAAA AATAAAAAAC GAGTAGATGC 
ACTACGGTAA CATTGATCGC AACGTTCAAT 
GGTATGTGGA AGTTAGATTG GGATCATAGC 
GAAAGACCAA AGCATACATA TTGAAAATTT 
TTTTAGACCG AAACAATGTG GAATTGGCCA 
TTAGGCATCG TTCCAAAGAA TGTATCTAAA 
TAAAGAACTA AGTATTTCTG AAGACTATAT 
TTGGGTACAA GATGATACCT TCGTTCCACT 
GATGAATATT TAAGTGATTT CGCAAAAAAA 
AACAGAAAGT CGTAACTATC CTCTAGAAAA 
GTTATGTTGG TCCCATTAAC TCTGAAGAAT 
GGCTATAAAG ATGATGCAGT TATTGGTAAA 
CGATAAAAAG CTCCAACATG AAGATGGCTA 
ATAATAGCAA TACAATCGCA CATACATTAA 
GGCAAAGATA TTCAACTAAC TATTGATGCT 
TAACAACATG AAAAATGATT ATGGCTCAGG 
CAGGTGAATT ATTAGCACTT GTAAGCACAC 
TTTATGTATG GCATGAGTAA CGAAGAATAT 
AAAAGAACCT CTGCTCAACA AGTTCCAGAT 
CTCAAAAAAT ATTAACAGCA ATGATTGGGT 
GATAAAACAA GTTATAAAAT CGATGGTAAA 
TTGGGGTGGT TACAACGTTA CAAGATATGA 
ACTTAAAACA AGCAATAGAA TCATCAGATA 
GCACTCGAAT TAGGCAGTAA GAAATTTGAA 
TGTTGGTGAA GATATACCAA GTGATTATCC 
CAAACAAAAA TTTAGATAAT GAAATATTAT 
CAAGGTGAAA TACTGATTAA CCCAGTACAG 
ATTAGAAAAT AATGGCAATA TTAACGCACC 
AAAACAAAGT TTGGAAGAAA AATATTATTT 
TTAAATGATG GTATGCAACA AGTCGTAAAT 
TTATAGATCT TATGCAA&CT TAATTGGCAA 
AAATGAAACA AGGAGAAAGT GGCAGACAAA 
GATAAAGATA ATCCAAACAT GATGATGGCT 
AGATAAAGGA ATGGCTAGCT ACAATGCCAA 
ATGAGCTATA TGAGAACGGT AATAAAAAAT 
AACAGTGAAG CAATCCGTAA CGATGGTTGC 
TATTAATAAG TGCTGTTACT TCTCCCTTAA 
TTGTATGTTG AAAGTGACAC TGTAACGAGT 
TTTCTTATTT GTAATTTCAG CGATAACGTA 
AGTTTAATAA ATTTAACGTT ATTCATTTGT 
TCCGTATTTA CCTTCTTCTA CCCATAATTT 
GCATGC 
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CAGGATCGTA 


AAATAAAAAA 
.rlxxxx X xxxxxxxxxxxx 


A O A 

4UU 


TCAATATAAA 

x ruin 


ATTA A A A OA A 
xi X IxuiHriV-xiri 




TTAATTTTPT 


T A A A CI A A CI A T 
x xAxxxxAjxxixVJXx x 


c a a 


GTCATTATT P 

VJ X wll XXI X X v^. 


PAPfZA ITHP A 
v^xxovjxxxx X bLH 


c: c n 


AAAATCAGAA 


PPTPPT A AAA 


^ a a 


ATACAGGAAP 


A P A T A TP A CI A 
xxv—xa lxil 




AAAGATTATA 


A APP A ATPPP 


*7 a a 


CAACA APAA A 


TPP A TP A AAA 
J- VJOXX x V^xxxxxxxx 


/DU 


TTAAAAPPPT 


T A A A A A A A TP 
Xxxxxxxxixxxxxx X vj 


Q A n 

o U U 


XXX v— XX X \_ X XXI 


PA A PT A A TP A 

UttXiw X XXXX X OXX 


OCA 


AGCGAPTTPA 


P A TPT A TT A CZ 


Q A A 


TAAAAPAAAA 


APA ATATA A A 
xxvjxxxx X xx X xxxxxx. 




AAGGGAPTPP 


A A A A A fTTTfl 
xxxxxxxxxxV^. X X Xxx 


T AAA 

IUUU 


TPGTGTPAPA 


xi- X WSJ X X VJXXV-.V3 


1UDU 


TAG AG A A A A A 

x xivjn vjxxxxxxxxxx 


PA A A A A Af^AT 

VJX^lXXXXXXXXXXVJXX J. 


T T A A 


AAAGTTP AA A 

XXXXXXVJ X X V-.XXXXXX 


A P A PT A TT T A 
rioHo X xx X X X xi 


n T en 


TAPTGPTATP 

XAU X wVw XXI X u 


PAPPPTPA A A 
v— xi\_ X v„»xxxxxx 


T O A A 


PT TCA TATPA 

\— x x v^-xi x xi. x vjri 


PHTPT21 TPP A 
wO xi X wLxi 


1 O C A 


A ATA A ATTA A 

XXXX X XXXXXX X X XXXX 


PPP AAPA T A A 
wwUXiflUfi X XiXx 


T O A A 


T A P A A PTTP IV 
x xxv*xxxx\». x x uri 


PPAPPTTPA A 
w wxAvjVj 1 1 L-XXXX 


T O C A 


TA A ATA APAA 

IXiririiXirt^-rJxl 


A APATTAP A P 
x*AwAl XxAtjXXA-. 


T yi A A 




A APATA A A TP 
xxxxLxtx x xAxxxx x 




AGTGGTAAAT 


PPT A A T A TPP 

wU X Xlfl X XX X I— VJ 


n c;aa 


A P A TTTTPTT 

rrwXl X X X X Vw X X 


TPPTAPA PTA 
A Ow X xxVjxxLj Xxx 


T tZ C A 


AAAGGPATGA 

XiXl/lwwwXl. X WXl 


A A A A APT Ann 
^xxxxxxxxv— XxxVjLs 


i con 


ATTTTATA AT 

ill x x iriiruil 


PPTPL A ATTT 
ov — L ^xXXxxxX X X 


IDjU 


TAPPTPATTP 

X XiVJL. X VJXX XIL 


APPiTTA ppp a 
xxvjrVjx XxxL.<wJOxx 


i -7 A r\ 
1 / UU 


ATPPTTTPA A 


TPTATAPPPP 


1 TCA 
1 / DU 


TPAPTTATT A 

X LrlL. X xrii JL/i 


A A APAP A PPA 

XXXXXXAjXX V-xxL. ^3TXX 


T Q A A 


PPA AAPA A A2V 
V-»xxxxxx\jxxxxxxx'X 


TATPA 7\ TfT7\ 
1 xx 1 L-xxxx. ± xxx 


"1 Q C A 


A A A A HAP AT A 
Xxxx/xxx. V^XXV-.xx. X XX 


A A PL A SP7VTST 
xxxx^xxxxLjxx X XX X 


T Q A A 

x you 


ATPPPPTAPT 
Xi X v^uvjulnv* X 


PP A p7v A PTP A 
wLAbH/iL x Cxx 


1 QCA 

x you 


ttpppthiptt 

X X X wU X X 


TA T ATP A TA T 
xxx,XxxX VJxxXxxX 


A A A 


A TT A A TP TT A 


A A Hi A TPT A P A 

xXXxkJXX X VJ X XxV^XX 


O A C A 


A ATPTPAPPT 

XxXx X V— X ^XxVjvj X 


AAA PTPT A TP 
XXXXXXw X o xxxl or 


O T A A 


APGATATAPA 

riv^vjrl X XI X XXVJXX 


TP A A TA A PA A 
X VaxAxx XxxxxL^xxXX 




TTCAPTGTTT 

x x wn\^ x \j xxx 


TATT A TP A A T 

X XX X XXXXVJXXXXX 


zzuu 


ATACAATTTP 

riJLnwuii x x v 


TTPATTTTPA 

X X V — XX X X X X LXi 




CCATTTTCTT 


TTTTTATGGA 


2300 


CAATGTATTA 


CCTGGTATAC 


2350 


GTTCCTGCTA 


CAACTTCTTC 


2400 


AAATGATATT 


GAAAGTGTAT 


2450 






2456 



2) INFORMATION FOR SEQ ID NO: 1179 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1179 
ATTTGGTGAC GGGTGACTTT 
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10 



20 



2 ) INFORMATION FOR SEQ ID NO: 1180 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1180 
15 TCCACCGTTG CCAATCGCA 19 



2) INFORMATION FOR SEQ ID NO: 1181 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 0 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 
25 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1181 

30 

AGCAGCTTAC TAGATGCCGT 20 



35 2) INFORMATION FOR SEQ ID NO: 1182 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 0 bases 

(B) TYPE: Nucleic acid 
40 (C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 
45 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1182 

AACTGCAAGA GATCCTTTGG 20 



50 

2 ) INFORMATION FOR SEQ ID NO: 1183 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 2535 bases 
55 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

60 
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WO 01/23604 PCT/CA00/01150 
(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus pneumoniae 

(B) STRAIN: 175 

(C) ACCESSION NUMBER: M18729 

5 

(xii) SEQUENCE DESCRIPTION: SEQ ID NO: 1183 

ATGGCGATAG AAAAGCTATC ACCCGGCATG CAACAGTATG TGGATATTAA 50 

10 AAAGCAATAT CCAGATGCTT TTTTGCTCTT TCGGATGGGT GATTTTTATG 100 

AATTATTTTA TGAGGATGCG GTCAATGCTG CGCAGATTCT GGAAATTTCC 150 

TTAACGAGTC GCAACAAGAA TGCCGACAAT CCGATCCCTA TGGCGGGTGT 200 

TCCCTATCAT TCTGCCCAAC AGTATATCGA TGTCTTGATT GAGCAGGGTT 250 

ATAAGGTGGC TATCGCAGAG CAGATGGAAG ATCCTAAACA AGCAGTTGGG 300 

15 GTTGTTAAAC GAGAGGTTGT TCAGGTCATT ACGCCAGGGA CAGTGGTCGA 350 

TAGCAGTAAG CCGGACAGTC AGAATAATTT TTTGGTTTCC ATAGACCGCG 400 

AAGGCAATCA ATTTGGCCTA GCTTATATGG ATTTGGTGAC GGGTGACTTT 450 

TATGTGACAG GTCTTTTGGA TTTCACGCTG GTTTGTGGGG AAATCCGTAA 500 

CCTCAAGGCT CGAGAAGTGG TGTTGGGTTA TGACTTGTCT GAGGAAGAAG 550 

2 0 AACAAATCCT CAGCCGCCAG ATGAATCTGG TACTCTCTTA TGAAAAAGAA 600 

AGCTTTGAAG ACCTTCATTT ATTGGATTTG CGATTGGCAA CGGTGGAGCA 650 

AACGGCATCT AGTAAGCTGC TCCAGTATGT TCATCGGACT CAGATGAGGG 700 

AATTGAACCA CCTCAAACCT GTTATCCGCT ACGAAATTAA GGATTTCTTG 750 

CAGATGGATT ATGCGACCAA GGCTAGTCTG GATTTGGTTG AGAATGCTCG 800 

2 5 CTCAGGTAAG AAACAAGGCA GTCTTTTCTG GCTTTTGGAT GAAACCAAAA 850 

CGGCTATGGG GATGCGTCTC TTGCGTTCTT GGATTCATCG CCCCTTGATT 900 

GATAAGGAAC GAATCGTCCA ACGTCAAGAA GTAGTGCAGG TCTTTCTCGA 950 

CCATTTCTTT GAGCGTAGTG ACTTGACAGA CAGTCTCAAG GGTGTTTATG 1000 

ACATTGAGCG CTTGGCTAGT CGTGTTTCTT TTGGCAAAAC CAATCCAAAG 1050 

30 GATCTCTTGC AGTTGGCGAC TACCTTGTCT AGTGTGCCAC GGATTCGTGC 1100 

GATTTTAGAA GGGATGGAGC AACCTACTCT AGCCTATCTC ATCGCACAAC 1150 

TGGATGCAAT CCCTGAGTTG GAGAGTTTGA TTAGCGCAGC GATTGCTCCT 12 00 

GAAGCTCCTC ATGTGATTAC AGATGGGGGA ATTATCCGGA CTGGATTTGA 12 50 

TGAGACTTTA GACAAGTATC GTTGCGTTCT CAGAGAAGGG ACTAGCTGGA 13 00 

35 TTGCTGAGAT TGAGGCTAAG GAGCGAGAAA ACTCTGGTAT CAGCACGCTC 1350 

AAGATTGACT ACAATAAAAA GGATGGCTAC TATTTTCATG TGACCAATTC 14 00 

GCAACTGGGA AATGTGCCAG CCCACTTTTT CCGCAAGGCG ACGCTGAAAA 1450 

ACTCAGAACG CTTTGGAACC GAAGAATTAG CCCGTATCGA GGGAGATATG 1500 

CTTGAGGCGC GTGAGAAGTC AGCCAACCTC GAATACGAAA TATTTATGCG 1550 

40 CATTCGTGAA GAGGTCGGCA AGTACATCCA GCGTTTACAA GCTCTAGCCC 1600 

AAGGAATTGC GACGGTTGAT GTCTTACAGA GTCTGGCGGT TGTGGCTGAA 1650 

ACCCAGCATT TGATTCGACC TGAGTTTGGT GACGATTCAC AAATTGATAT 1700 

CCGGAAAGGG CGCCATGCTG TCGTTGAAAA GGTTATGGGG GCTCAGACCT 1750 

ATATTCCAAA TACGATTCAG ATGGCAGAAG ATACCAGTAT TCAATTGGTT 1800 

45 ACAGGGCCAA ACATGAGTGG GAAGTCTACC TATATGCGTC AGTTAGCCAT 1850 

GACGGCGGTT ATGGCCCAGC TGGGTTCCTA TGTTCCTGCT GAAAGCGCCC 1900 

ATTTACCGAT TTTTGATGCG ATTTTTACCC GTATCGGAGC AGCAGATGAC 1950 

TTGGTTTCGG GTCAGTCAAC CTTTATGGTG GAGATGATGG AGGCCAATAA 2000 

TGCCATTTCG CATGCGACCA AGAACTCTCT CATTCTCTTT GATGAATTGG 2050 

50 GACGTGGAAC TGCAACTTAT GACGGGATGG CTCTTGCTCA GTCCATCATC 2100 

GAATATATCC ATGAGCACAT CGGAGCTAAG ACCCTCTTTG CGACCCACTA 2150 

CCATGAGTTG ACTAGTCTGG AGTCTAGTTT ACAACACTTG GTCAATGTCC 22 00 

ACGTGGCAAC TTTGGAGCAG GATGGGCAGG TCACCTTCCT TCACAAGATT 2250 

GAACCGGGAC CAGCTGATAA ATCCTACGGT ATCCATGTTG CCAAGATTGC 23 00 

55 TGGCTTGCCA GCAGACCTTT TAGCAAGGGC GGATAAGATT TTGACTCAGC 2350 

TAGAGAATCA AGGAACAGAG AGTCCTCCTC CCATGAGACA AACTAGTGCT 2400 

GTCACTGAAC AGATTTCACT CTTTGATAGG GCAGAAGAGC ATCCTATCCT 2450 

AGCAGAATTA GCTAAACTGG ATGTGTATAA. TATGACACCT ATGCAGGTTA 2500 

TGAATGTCTT AGTAGAGTTA AAACAGAAAC TATAA 2535 



60 
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2) INFORMATION FOR SEQ ID NO: 1184 

(i) SEQUENCE CHARACTERISTICS: 
5 (A) LENGTH: 623 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

10 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus pneumoniae 

(B) STRAIN: StrR-05 

(xiii) SEQUENCE DESCRIPTION: SEQ ID NO: 1184 



15 



TGACGGGTGA CTTTTATGTG ACAGGTCTTT TGGATTTCAC GCTGGTTTGT 50 

GGGGAAATCC GTAACCTCAA GGCTCGAGAA GTGGTGTTGG GTTATGACTT 100 

20 GTCTGAGGAA GAAGAACAAA TCCTCAGCCG CCAGATGAAT CTGGTACTCT 15 0 

CTTATGAAAA AGAAAGCTTT GAAGACCTTC ATTTATTGGA TTTGCGATTG 2 00 

GCAACGGTGG AGCAAACGGC ATCTAGTAAG CTGCTCCAGT ATGTTCATCG 25 0 

GACTCAGATG AGGGAATTGA ACCACCTCAA ACCTGTTATC CGATACGAAA 3 00 

TTAAGGATTT CTTGCAGATG GATTATGCGA CCAAGGCTAG TCTGGATTTG 3 50 

25 GTTGAGAATG CTCGCTCAGG TAAGAAACAA GGCAGTCTTT TCTGGCTTTT 4 00 

GGATGAAACC AAAACGGCTA TGGGGATGCG TCTCTTGCGT TCTTGGATTC 45 0 

ATCGCCCCTT GATTGATAAG GAACGAATCG TCCAACGTCA AGAAGTAGTG 500 

CAGGTCTTTC TCGACCATTT CTTTGAGCGT AGTGACTTGA CAGACAGTCT 55 0 

CAAGGGTGTT TATGACATTG AGCGCTTGGC TAGTCGTGTT TCTTTTGGCA 60 0 

30 AAACCAATCC AAAGGATCTC TTG 623 



35 



2) INFORMATION FOR SEQ ID NO: 1185 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 621 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
40 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
45 (A) ORGANISM: Streptococcus pneumoniae 

(B) STRAIN: StrR-06 

(xiv) SEQUENCE DESCRIPTION: SEQ ID NO: 1185 

50 TGACGGGTGA CTTTTATGTG ACAGGTCTTT TGGATTTCAC GCTGGTTTGT 50 

GGGGAAATCC GTAACCTCAA GGCTCGAGAA GTGGTGTTGG GTTATGACTT 10 0 

GTCTGAGGAA GAAGAACAAA TCCTCAGCCG CCAGATGAAT CTGGTACTCT 150 

CTTATGAAAA AGAAAGCTTT GAAGACCTTC ATTTATTGGA TTTGCGATTG 2 00 

GCAACGGTGG AGCAAACGGC ATCTAGTAAG CTGCTCCAGT ATGTTCATCG 250 

55 GACTCAGATG AGGGAATTGA ACCACCTCAA ACCTGTTATC CGATACGAAA 3 00 

TTAAGGATTT CTTGCAGATG GATTATGCGA CCAAGGCTAG TCTGGATTTG 3 50 

GTTGAGAATG CTCGCTCAGG TAAGAAACAA GGCAGTCTTT TCTGGCTTTT 4 00 

GGATGAAACC AAAACGGCTA TGGGGATGCG TCTCTTGCGT TCTTGGATTC 4 50 

ATCGCCCCTT GATTGATAAG GAACGAATCG TCCAACGTCA AGAAGTAGTG 500 

60 CAGGTCTTTC TCGACCATTT CTTTGAGCGT AGTGACTTGA CAGACAGTCT 550 
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CAAGGGTGTT TATGACATTG AGCGCTTGGC TAGTCGTGTT TCTTTTGGCA 60 0 

AAACCAATCC AAAGGATCTC T 621 



10 



15 



20 



2) INFORMATION FOR SEQ ID NO : 1186 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 622 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus pneumoniae 

(B) STRAIN: StrR-11 

( xv ) SEQUENCE DESCRIPTION: SEQ ID NO: 1186 



TGACGGGTGA 
GGGGAAATCC 
GTCTGAGGAA 

25 CTTATGAAAA 
GCAACGGTGG 
GACTCAGATG 
TTAAGGATTT 
GTTGAGAATG 

30 GGATGAAACC 
ATCGCCCCTT 
CAGGTCTTTC 
CAAGGGTGTT 
AAACCAATCC 

35 



CTTTTATGTG 
GTAACCTCAA 
GAAGAACAAA 
AGAAAGCTTT 
AGCAAACGGC 
AGGGAATTGA 
CTTGCAGATG 
CTCGCTCAGG 
AAAACGGCTA 
GATTGATAAG 
TCGACCATTT 
TATGACATTG 
AAAGGATCTC 



ACAGGTCTTT 
GGCTCGAGAA 
TCCTCAGCCG 
GAAGACCTTC 
ATCTAGTAAG 
ACCACCTCAA 
GATTATGCGA 
TAAGAAACAA 
TGGGGATGCG 
GAACGAATCG 
CTTTGAGCGT 
AGCGCTTGGC 
TT 



TGGATTTCAC 
GTGGTGTTGG 
CCAGATGAAT 
ATTTATTGGA 
CTGCTCCAGT 
ACCTGTTATC 
CCAAGGCTAG 
GGCAGTCTTT 
TCTCTTGCGT 
TCCAACGTCA 
AGTGACTTGA 
TAGTCGTGTT 



GCTGGTTTGT 5 0 

GTTATGACTT 100 

CTGGTACTCT 150 

TTTGCGATTG 2 00 

ATGTTCATCG 250 

CGCTACGAAA 30 0 

TCTGGATTTG 3 50 

TCTGGCTTTT 4 00 

TCTTGGATTC 450 

AGAAGTAGTG 500 

CAGACAGTCT 550 

TCTTTTGGCA 600 

622 



2) INFORMATION FOR SEQ ID NO: 1187 

40 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 622 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



45 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus pneumoniae 
50 (B) STRAIN: StrR-55 

(ix) SEQUENCE DESCRIPTION: SEQ ID NO: 1187 

TGACGGGTGA CTTTTATGTG ACAGGTCTTT TGGATTTCAC GCTGGTTTGT 50 

55 GGGGAAATCC GTAACCTCAA GGCTCGAGAA GTGGTGTTGG GTTATGACTT 100 

GTCTGAGGAA GAAGAACAAA TCCTCAGCCG CCAGATGAAT CTGGTACTCT 150 

CTTATGAAAA AGAAAGCTTT GAAGACCTTC ATTTATTGGA TTTGCGATTG 200 

GCAACGGTGG AGCAAACGGC ATCTAGTAAG CTGCTCCGGT ATGTTCATCG 250 

GACTCAGATG AGGGAATTGA ACCACCTCAA ACCTGTTATC CGCTACGAAA 3 00 

60 TTAAGGATTT CTTGCAGATG GATTATGCGA CCAAGGCTAG TCTGGATTTG 3 50 
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SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 

GTTGAGAATG CTCGCTCAGG TAAGAAACAA 
GGATGAAACC AAAACGGCTA TGGGGATGCG 
ATCGCCCCTT GATTGATAAG GAACGAATCG 
CAGGTCTTTC TCGACCATTT CTTTGAGCGT 
5 CAAGGGTGTT TATGACATTG AGCGCTTGGC 
AAACCAATCC AAAGGATCTC TT 
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GGCAGTCTTT TCTGGCTTTT 4 00 

TCTCTTGCGT TCTTGGATTC 45 0 

TCCAACGTCA AGAAGTAGTG 500 
AGTGACTTGA CAGACAGTCT 550 
TAGTCGTGTT TCTTTTGGCA 60 0 

622 



10 2) INFORMATION FOR SEQ ID NO: 1188 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 599 bases 

(B) TYPE: Nucleic acid 
15 (C) STRANDEDNESS : Double 

<D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

20 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus oralis 

(B) STRAIN: ATCC 35037 



25 



40 



50 



(ix) SEQUENCE DESCRIPTION: SEQ ID NO: 1188 



GGGTGACTTT TATGTAACGG GGCTATTGGA TTTCACGTTG GTTTGTGGGG 50 

AAATTCGCAA TCTCAAGGCT AGAGAAGTGG TGCTGGGTTA TGACTTGTCT 100 

GAGGAAGAAG AACAAATCCT CAGTCGTCAG ATGAATCTGG TGCTTTCTTA 15 0 

TGAGAAGGAA GGCTTTGAGG ACCTTCATTT ACTGGATCCA CGACTGGCAG 2 00 

30 CTGTGGAGCA AGCGGCAGCT AGTAAGCTCC TCCAGTATGT TCACCGGACC 2 50 

CAGATGCGGG AATTGAACCA CCTCAAACCA GTTATCCGCT ATGAAATCAA 3 00 

AGATTTCTTA CAGATGGACT ATGCGACCAA GGCTAGTCTG GATTTGGTTG 3 50 

AGAATGCCCG TTCAGGCAAG AAGCAAGGCA GTCTTTTCTG GCTTTTAGAT 4 00 

GAAACCAAGA CGGCTATGGG AATGCGTCTC TTGCGTTCTT GGATTCATCG 450 

35 TCCTTTGATT GATAAGGAGC GAATCGTCCA GCGTCAAGAG GTGGTGCAGG 500 

TCTTTCTTGA CCACTTCTTT GAGCGTAGTG ATTTAACGGA CAGTCTTAAG 550 

GGTGTTTATG ATATCGAACG CTTGGCTAGT CGGGTTTCTT TTGGCAAGA 5 99 



2) INFORMATION FOR SEQ ID NO: 1189 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 624 bases 
45 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus mitis 

(B) STRAIN: ATCC 49456 

55 (x) SEQUENCE DESCRIPTION: SEQ ID NO: 1189 

GGTGACGGGT GACTTTTATG TGACAGGTCT TTTGGATTTT ACGCTGGTTT 50 

GTGGGGAAAT CCGCAATCTC AAGGCTCGAG AAGTGGTGCT GGGTTATGAC 100 

TTGTCTGAGG AAGAAGAACA GATCCTTAGT CGTCAGATGA ATCTGGTACT 150 

60 TTCCTATGAA AAAGAAGGCT TTGAAGACCT TCATTTACTG GATTCACGAT 2 00 
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TGGCAGCTGT GGAGCAAGCG GCATCTAGTA AACTGCTTCA GTATGTTCAT 250 

CGGACTCAGA TGAGGGAATT GAACCACCTC AAGCCTGTTA TCCGCTATGA 3 00 

AATCAAAGAT TTTTTGCAGA TGGATTATGC GACCAAGGCT AGTCTGGATT 3 50 

TGGTTGAGAA TGCCCGTTCA GGCAAGAAGC AAGGTAGTCT TTTTTGGCTT 4 00 

TTGGATGAAA CCAAAACAGC TATGGGAATG CGTCTCTTGC GGTCTTGGAT 450 

TCATCGCCCC CTGATTGATA AGGAACGAAT TGTCCAACGC CAAGAAGTTG 500 

TGCAGGTCTT TCTCGACCAT TTCTTTGAGC GTAGTGATTT GACAGACAGT 550 

CTCAAGGGTG TTTATGACAT TGAGCGCTTG GCTAGTCGTG TTTCTTTTGG 600 

CAAAACCAAT CCAAAGGATC TCTT 624 



2) INFORMATION FOR SEQ ID NO: 1190 

15 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 599 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus mitis 
25 (B) STRAIN: LSPQ 2583 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1190 

TGACGGGTGA CTTTCAGGTG ACTAGTTTAG AGGACTTTGT CTTGGTCTGC 50 

30 GGGGAAATCC GCAATTTGAA AGCTAGGGAA GTGGTGCTGG GCTATGCCTT 100 

GCCAGAAGCT GAGGAGCAGG TTTTGGCTGG ACAGATGAAC CTTTTACTGT 150 

CCTATGTGGA GAAGGTTTTG GAGGATGTTC AGCTGCTGGG CGAGGAGCTG 200 

TCTCCTATGG AGCGTCAGGC AGCAGGGAAA CTGCTGGAGT ATGTGCACCG 250 

GACCCAGATG AGGGAGCTCA GCCATTTGAA GAAGGCTCAG CATTATGAAA 3 00 

35' TCAAGGACTT CCTGCAAATG GACTATGCCA CCAAGGCGAG TCTGGATTTG 350 

ACAGAAAATG CTCGCTCGGG CAAGAAGCAC GGCAGTCTTT ATTGGCTGAT 400 

GGACGAGACT AAGACGGCCA TGGGCGGCCG CATGCTGCGC TCTTGGATCC 450 

AGCGTCCGCT GATTGATGAA GCGCGAATTA GCCAGCGACA GAATGTCGTT 500 

GAGGTTTTTC TGGATCATTT CTTTGAGCGG AGTGATTTGA CGGAGAGCCT 550 

40 CAAGGGGGTC TATGATATCG AGCGGCTGGC TAGTCGGGTG TCTTTTGGC 599 



2) INFORMATION FOR SEQ ID NO: 1191 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 622 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
50 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
55 (A) ORGANISM: Streptococcus mitis 

(B) STRAIN: ATCC 903 

(ix) SEQUENCE DESCRIPTION: SEQ ID NO: 1191 

60 TGACGGGTGA CTTTCAGGTG ACTAGTTTAG AGGACTTTGC CCTGGTCTGC 50 
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GGGGAAATCC GTAATTTGAA GGCTAGGGAA GTGGTGCTGG GCTATGCTTT 100 

GCCAGAAGCT GAGGAGCAGG TCTTGGCTGG ACAGATGAAT CTTTTGCTGT 150 

CCTATGTACA GACGGCCTTG GACGATGTCC AGCTGCTGGG CGAGGAACTG 200 

TCTCCTATGG AGCGTCAGGC AGCGGGGAAA TTGCTAGAGT ATGTGCACCG 250 

5 GACCCAGATG AGGGAGCTCA GCCATTTGAA GAAGGCCCAG CATTATGAAA 3 00 

TCAAGGACTT TCTGCAAATG GATTATGCTA CCAAGGCGAG TCTGGATTTG 350 

ACAGAAAATG CTCGCTCGGG TAAGAAACAC GGCAGTCTTT ATTGGCTGAT 400 

GGACGAGACC AAGACGGCCA TGGGCGGCCG TATGCTGCGC TCTTGGATCC 450 

AGCGTCCGTT GATTGATGAA GTGCGAATTA GCCAGCGGCA GAATGTCGTC 500 

10 GAGGTTTTTC TGGAACATTT CTTTGAGCGG AGTGATTTGA CGGAGAGCCT 550 

CAAGGGAGTC TATGATATCG AGCGGCTGGC TAGTCGGGTG TCTTTTGGCA 600 

AGACCAATCC AAAGGATCTC TT 622 



15 



25 



30 



2) INFORMATION FOR SEQ ID NO: 1192 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 22 bases 
20 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1192 
GGTAAAACAG GAACCTCTAA CT 22 

2) INFORMATION FOR SEQ ID NO: 1193 



(i) SEQUENCE CHARACTERISTICS: 
35 (A) LENGTH: 22 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

40 (ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1193 
GGTAAGACAG GTACTTCTAA CT 22 

45 



2) INFORMATION FOR SEQ ID NO: 1194 

50 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 



55 



60 



(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1194 
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PCT/CAOO/01150 

24 



5 2 ) INFORMATION FOR SEQ ID NO: 1195 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 bases 

(B) TYPE: Nucleic acid 
10 (C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 
15 (xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1195 

CATTTCAAGT AACACAACTG AATC 24 



20 



30 



35 



60 



2) INFORMATION FOR SEQ ID NO: 1196 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 24 bases 
25 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1196 
GCCATTTCAA GTAATACAAC AGAA 24 

2) INFORMATION FOR SEQ ID NO: 1197 



(i) SEQUENCE CHARACTERISTICS: 
40 (A) LENGTH: 25 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

45 (ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1197 
CAAACGCCAT TTCAAGTAAT ACAAC 25 

50 

2) INFORMATION FOR SEQ ID NO: 1198 

55 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 381 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 
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(ii) MOLECULE TYPE: Genomic DNA 
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(vi) ORIGINAL SOURCE : 

(A) ORGANISM: Staphylococcus saprophyticus 

(B) STRAIN: ATCC 43867 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1198 



AACGGGCGTC 
ATCAATATAC 
CAAATAAAAA 
GATTCTGGTT 
TTTAAAGTAT 
ATCATGAAAC 
TACAACAACG 
AACATCCTAT 



TCGATAGAAA 
GATTTTGCTG 
TTAATAGGCA 
TTTTAAATTC 
AGGTATGAAA 
AATTAACTAA 
CTGATTGGCG 
TGATTTTGTG 



AACACGTGAA 
AAGCTGTAAC 
ACTTAACCAG 
AAAAAGTTTT 
TACAATTGAT 
GCCTTTATAC 
CGTTACTATT 
GACGCCCGTT 



AATCCCAATG 
AAAAGGTATT 
AATCGTTAAA 
CTAAAAAATT 
TAAAATAGTA 
TTTTACCTAT 
ATATTTGCCA 
A 



ATTATAAACA 50 

AAGGAACAAA 100 

ACTATATGAC 150 

TACTTGCTTC 2 00 

AAGGAAATGA 250 

TACTTTTTAT 3 00 

ATCACAKGTA 350 

381 



2) INFORMATION FOR SEQ ID NO: 1199 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1199 
GTATTAAAGA AGATATCCAA AAAGC 



2) INFORMATION FOR SEQ ID NO: 1200 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12 00 
TCAAAGAAGA AACTAAAAAA GCTGT 



2) INFORMATION FOR SEQ ID NO: 1201 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1201 
AACGTAGGTG TCCTTCTTC 



2) INFORMATION FOR SEQ ID NO: 1202 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1202 
GTGTTGAAAT GTTCCGTAAA CA 



2) INFORMATION FOR SEQ ID NO: 12 03 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12 03 
GGIGARMGIG GIAAYGARAT G 



2 ) INFORMATION FOR SEQ ID NO: 1204 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION; SEQ ID NO: 12 04 
GC I AAYAAC I TC I WMYATGC C 



2 ) INFORMATION FOR SEQ ID NO: 12 05 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 
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(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1205 
AAYACITCIA WYATGCC I GT 20 



2 ) INFORMATION FOR SEQ ID NO: 1206 



(i) SEQUENCE CHARACTERISTICS: 
15 (A) LENGTH: 19 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

2 0 (ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1206 
CKISRIGTIG ARTCIGCCA 19 

25 

2) INFORMATION FOR SEQ ID NO: 1207 

30 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

35 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1207 
4 0 CCITCITCWC C I GGCATYTC 2 0 



2 ) INFORMATION FOR SEQ ID NO: 1208 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 
50 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1208 

55 

TCAAAAAGTT TTCTAAAAAA TTTAC 25 



60 2) INFORMATION FOR SEQ ID NO: 1209 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY : Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1209 
ACGGGCGTCC ACAAAATCAA TAGGA 



2) INFORMATION FOR SEQ ID NO: 1210 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1210 
ACCAGCTTGC CCAATACAAA GG 



2 ) INFORMATION FOR SEQ ID NO: 1211 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1211 
ATTCTTGTAA CAGGCTTTGA TCCC 



2) INFORMATION FOR SEQ ID NO: 1212 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1212 
CCICCIRGIG GIGAIACIGC WCC 

657 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 



PCT/CA00/01150 



2 ) INFORMATION FOR SEQ ID NO: 1213 

5 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 
10 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1213 

15 

AARGGIGGIA CIGCIGCIAT HCCIGG 26 



20 2) INFORMATION FOR SEQ ID NO: 1214 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 
25 (C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 
30 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1214 

GGTAAAACAG GTACCTCTAA CTA 23 



35 



45 



50 



2 ) INFORMATION FOR SEQ ID NO: 1215 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 1337 bases 
40 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus pyogenes 

(B) STRAIN: D471 

(C) ACCESSION NUMBER: X65717 

(xii) SEQUENCE DESCRIPTION: SEQ ID NO: 1215 



AACAAAATAA AAGAACTTAC CTATTTTCCA TCCAAAATGT TTAGCAATCA 50 

TCATCTGCAA GGCAACGTAT TGCATGGCAT TGATGTGATG AGCAACTAAT 100 

55 ATGTCATTAG AACGTTGCGT CAAACTAGCA TCTAAATAAA GATCGAAATG 150 

CAGTTATCAA AAATGCAAGC TCCTATCGGC CCTTGTTTTA ATTATTACTC 2 00 

ACATTGCCTT AATGTATTTA CTTGCTTATT ATTAACTTTT TTGCTAAGTT 250 

AGTAGCGTCA GTTATTCATT GAAAGGACAT TATTATGAAA ATTCTTGTAA 3 00 

CAGGCTTTGA TCCCTTTGGC GGCGAAGCTA TTAATCCTGC CCTTGAAGCT 350 

60 ATCAAGAAAT TGCCAGCAAC CATTCATGGA GCAGAAATCA AATGTATTGA 4 00 
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AGTTCCAACG GTTTTTCAAA AATCTGCCGA TGTGCTCCAG CAGCATATCG 45 0 

AAAGCTTTCA ACCTGATGCA GTCCTTTGTA TTGGGCAAGC TGGTGGCCGG 500 

ACTGGACTAA CGCCAGAACG CGTTGCCATT AATCAAGACG ATGCTCGCAT 550 

TCCTGATAAC GAAGGGAATC AGCCTATTGA TACACCTATT CGTGCAGATG 60 0 

5 GTAAAGCAGC TTATTTTTCA ACCTTGCCAA TCAAAGCGAT GGTTGCTGCC 650 

ATTCATCAGG CTGGGCTTCC TGCTTCTGTT TCTAATACAG CTGGTAC CTT 700 

TGTTTGCAAT CATTTGATGT ATCAAGCCCT TTACTTAGTG GATAAATATT 750 

GTCCAAATGC CAAAGCTGGG TTTATGCATA TTCCCTTTAT GATGGAACAG 800 

GTTGTTGATA AACCTAATAC AGCTGCCATG AACCTCGATG ATATTACAAG 850 

10 AGGAATTGAG GCTGCTATTT TTGCCATTGT CGATTTCAAA GATCGTTCCG 900 

ATTTAAAACG TGTAGGGGGC GCTACTCACT GACTGTGACG CTACTAAACC 950 

TATTTTAAAA AAACAGAGAT ATGAACTAAC TCTGTTTTTT TTGTGCTAAA 1000 

AATGAAAGAC CTAGGGAAAC TTTTCATCGG TCTTTCTCAA TTGTCATCTT 1050 

AATCTAATAC TACTTCTAAC ATCAGCGGGT ATAGTTTGCC AGTAATTAAG 1100 

15 AAACGTTGTT GATCTAAATG AGCAATCCCA TTCAAAACAT TAAGGTCAGG 1150 

GTAATGGGAC TTATCAAGAT TTAAGGCTTT TAACAAAGGA CTAATATCAT 1200 

AGGTGGCTAC CACCTTTCCA GAATCAGGTT GGAGTTTGAC AATAGTATTG 1250 

GTTTGCCAAA TATTGGCATA GAGATAACCA TCTACATACT CTAATTCGTT 1300 

AAGCATTGAG ATAGGGACAC TTTCTATAGC AACTAGT 1337 

20 

2) INFORMATION FOR SEQ ID NO: 1216 

25 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

30 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1216 
35 GGTAAGACTG GTACATCAAA CTA 23 



40 



2) INFORMATION FOR SEQ ID NO: 1217 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 
45 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1217 

50 

CAAATGCCAT TTCAAGTAAC ACAAC 25 



55 2 ) INFORMATION FOR SEQ ID NO: 1218 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 bases 

(B) TYPE: Nucleic acid 
60 (C) STRANDEDNESS: Single 
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(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1218 
CAAACGCCAT TTCAAGTAAC ACAAC 



2) INFORMATION FOR SEQ ID NO: 1219 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1219 
CAAATGCTAT TTCAAGTAAT ACAAC 



2) INFORMATION FOR SEQ ID NO: 122 0 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 bases 

(B) TYPE : Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 122 0 
CAAACGCCAT TTCAAGTAAT ACGAC 



2) INFORMATION FOR SEQ ID NO: 1221 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1221 
GAYACICCIG GICAYGTIGA YTT 



2) INFORMATION FOR SEQ ID NO: 1222 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 6 bases 

(B) TYPE : Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1222 
ATYGAYACIC CIGGICAYGT IGAYTT 



2) INFORMATION FOR SEQ ID NO: 1223 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 6 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1223 
AYITCIARRT GIARYTCRCC CATICC 



2) INFORMATION FOR SEQ ID NO: 1224 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1224 
CCIGYIHTIY TIGARCCIAT IATG 



2) INFORMATION FOR SEQ ID NO: 1225 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1225 
TAI CCRAACA TYTCISMIAR IGGIAC 
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2) INFORMATION FOR SEQ ID NO: 1226 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 0 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY : Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 122 6 
GTIRMRTAIC CRAACATYTC 



2) INFORMATION FOR SEQ ID NO: 1227 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1227 
GTICCIYTIK C I GARATGTT YGGITA 



2) INFORMATION FOR SEQ ID NO: 1228 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 9 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1228 
GTICCIYTIK C I GARATGTT YGGITAYGC 



2) INFORMATION FOR SEQ ID NO: 1229 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY : Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ . ID NO: 1229 
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TCCATYTGIG CIGCICCIGT IATCAT 



26 



5 2) INFORMATION FOR SEQ ID NO: 1230 



10 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2145 bases 

(B) TYPE: Nucleic acid 

( C ) STRANDEDNE S S : Doub 1 e 

(D) TOPOLOGY: Linear 



15 



20 



25 



30 



35 



40 
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50 



55 



60 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Escherichia coli 
(D) ACCESSION NUMBER: X00415 

(xiii) SEQUENCE -DESCRIPTION: SEQ ID NO: 1230 

TGAACGCCTA AAAGATAAAC GAGGAAACAA ATGGCTCGTA CAACACCCAT 50 

CGCACGCTAC CGTAACATCG" GTATCAGTGC GCACATCGAC GCCGGTAAAA 100 

CCACTACTAC CGAACGTATT CTGTTCTACA CCGGTGTAAA CCATAAAATC 150 

GGTGAAGTTC ATGACGGCGC TGCAAC CATG GACTGGATGG AGCAGGAGCA 2 00 

GGAACGTGGT ATTACCATCA CTTCCGCTGC GACTACTGCA TTCTGGTCTG 250 

GTATGGCTAA GCAGTATGAG CCGCATCGCA TCAACATCAT CGACACCCCG 3 00 

GGGCACGTTG ACTTCACAAT CGAAGTAGAA CGTTCCATGC GTGTTCTCGA 350 

TGGTGCGGTA ATGGTTTACT GCGCAGTTGG TGGTGTTCAG CCGCAGTCTG 4 00 

AAACCGTATG GCGTCAGGCA AACAAATATA AAGTTCCGCG CATTGCGTTC 450 

GTTAACAAAA TGGACCGCAT GGGTGCGAAC TTCCTGAAAG TTGTTAACCA 500 

GATCAAAACC CGTCTGGGCG CGAACCCGGT TCCGCTGCAG CTGGCGATTG 550 

GTGCTGAAGA ACATTTCACC GGTGTTGTTG AC CTGGTGAA AATGAAAGCT 600 

ATCAACTGGA ACGACGCTGA CCAGGGCGTA ACCTTCGAAT ACGAAGATAT 650 

CCCGGCAGAC ATGGTTGAAC TGGCTAACGA ATGGCACCAG AACCTGATCG 700 

AATCCGCAGC TGAAGCTTCT GAAGAGCTGA TGGAAAAATA CCTGGGTGGT 750 

GAAGAACTGA CTGAAGCAGA AATCAAAGGT GCTCTGCGTC AGCGCGTTCT 800 

GAACAACGAA ATCATCCTGG TAACCTGTGG TTCTGCGTTC AAGAACAAAG 850 

GTGTTCAGGC GATGCTGGAT GCGGTAATTG ATTACCTGCC ATCCCCGGTT 900 

GACGTACCTG CGATCAACGG TATCCTGGAC GACGGTAAAG ACACTCCGGC 950 

TGAACGTCAC GCAAGTGATG ACGAGCCGTT CTCTGCACTG GCGTTCAAAA 1000 

TCGCTACCGA CCCGTTTGTT GGTAACCTGA CCTTCTTCCG TGTTTACTCC 1050 

GGTGTGGTTA ACTCTGGTGA TACCGTACTG AACTCCGTGA AAGCTGCACG 1100 

TGAGCGTTTC GGTCGTATCG TTCAGATGCA CGCTAACAAA CGTGAAGAGA 1150 

TCAAAGAAGT TCGCGCGGGC GACATCGCTG CTGCTATCGG TCTGAAAGAC 1200 

GTAACCACTG GTGACACCCT GTGTGACCCG GATGCGCCGA TCATTCTGGA 1250 

ACGTATGGAA TTCCCTGAGC CGGTAATCTC CATCGCAGTT GAACCGAAAA 1300 

CCAAAGCTGA CCAGGAAAAA ATGGGTCTGG CTCTGGGCCG TCTGGCTAAA 1350 

GAAGACCCGT CTTTCCGTGT ATGGACTGAC GAAGAATCTA ACCAGACCAT 14 00 

CATCGCGGGT ATGGGCGAAC TGCACCTCGA CATCATCGTT GACCGTATGA 1450 

AGCGTGAATT CAACGTTGAA GCGAACGTAG GTAAACCGCA GGTTGCTTAC 1500 

CGTGAAACTA TCCGCCAGAA AGTTACCGAT GTTGAAGGTA AACACGCGAA 1550 

ACAGTCTGGT GGTCGTGGTC AGTATGGTCA TGTTGTTATC GACATGTACC 1600 

CGCTGGAGCC GGGTTCAAAC CCGAAAGGCT ACGAGTTCAT CAACGACATT 1650 

AAAGGTGGTG TAATCCCTGG CGAATACATC CCGGCCGTTG ATAAAGGTAT 1700 

CCAGGAACAG CTGAAAGCAG GTCCGCTGGC AGGCTACCCG GTAGTAGACA 1750 

TGGGTATTCG TCTGCACTTC GGTTCTTACC ATGACGTTGA CTCCTCTGAA 18 00 

CTGGCGTTTA AACTGGCTGC TTCTATCGCC TTTAAAGAAG GCTTTAAGAA 1850 

AGCGAAACCA GTTCTGCTTG AGCCGATCAT GAAGGTTGAA GTAGAAACTC 1900 

CGGAAGAGAA CACCGGTGAC GTTATCGGTG ACTTGAGCCG TCGTCGTGGT 1950 

ATGCTCAAAG GTCAGGAATC TGAAGTTACT GGCGTTAAGA TCCACGCTGA 2000 
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AGTACCGCTG TCTGAAATGT TCGGATACGC AACTCAGCTG CGTTCTCTGA 2 0 50 
CCAAAGGTCG TGCATCATAC ACTATGGAAT TCCTGAAGTA TGATGAAGCG 2100 
CCGAGTAACG TTGCTCAGGC CGTAATTGAA GCCCGTGGTA AATAA 214 5 



2) INFORMATION FOR SEQ ID NO: 1231 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 37 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY : Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1231 
GCGAGCCCGA AGATAAAAAA GAACCTCTGC TGCTCGC 



2) INFORMATION FOR SEQ ID NO: 1232 

(i) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 42 bases 
(E) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1232 
GGAGCCGCGC GATTTTATAA ATGAATGTTG ATAACCGGCT CC 



2) INFORMATION FOR SEQ ID NO: 1233 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 35 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1233 
GCGAGCGTTA CTGGTGTAGA AATGTTCCGG CTCGC 



2) INFORMATION FOR SEQ ID NO: 1234 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 
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(D) TOPOLOGY: Linear 
(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1234 
ACTAAATAAA CGCTCATTCG 



2 ) INFORMATION FOR SEQ ID NO: 1235 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 8 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1235 
GCGAGCCGAA GTTGAAGTTG TTGGTATTGC TGGCTCGC 



2 ) INFORMATION FOR SEQ ID NO: 1236 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 34 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 123 6 
GCGAGCCGTG GTGAAGTTCG CGTTGGTGGC TCGC 



2) INFORMATION FOR SEQ ID NO: 1237 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 38 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1237 
GCGAGCCGCG AAATCGAAGT TGCTGTATTA GGGCTCGC 



2) INFORMATION FOR SEQ ID NO: 1238 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 38 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1238 
GCGAGCGGCG TTAATTTTGG CACCGAAGAA GAGCTCGC 38 



15 2 ) INFORMATION FOR SEQ ID NO: 1239 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 bases 

(B) TYPE: Nucleic acid 
20 (C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 
25 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1239 

GCGAGCGCAG ACCTTTCAGC AGAGGAGGCT CGC 33 



2 ) INFORMATION FOR SEQ ID NO: 1240 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 38 bases 
35 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 124 0 
GCGAGCCGGC AAGACAATAT GACAGCAAAA TCGCTCGC 38 

2 ) INFORMATION FOR SEQ ID NO: 1241 



(i) SEQUENCE CHARACTERISTICS: 
50 (A) LENGTH: 35 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

55 (ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1241 
GCGAGCGGGG AACGAGGATG ATTTGATTGG CTCGC 35 

60 
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2) INFORMATION FOR SEQ ID NO: 1242 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 600 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus faecium 
15 (E) STRAIN: BM4147-1 

(F) ACCESSION NUMBER: U39790 

(xiv) SEQUENCE DESCRIPTION: SEQ ID NO: 1242 

20 TTCTTAGAGA CATTGAATAT GCCTTATGTC GGCGCAGGCG TATTGACCAG 50 

TGCATGTGCC ATGGATAAAA TCATGACCAA GTATATTTTA CAAGCTGCTG 10 0 

GTGTGCCGCA AGTTCCTTAT GTACCAGTAC TTAAGAATCA ATGGAAAGAA 150 

AATCCTAAAA AAGTATTTGA TCAATGTGAA GGTTCTTTGC TTTATCCGAT 20 0 

GTTTGTCAAA CCTGCGAATA TGGGTTCTAG TGTCGGCATT ACAAAGGCAG 250 

2 5 AAAACCGAGA AGAGCTGCAA AATGCTTTAG CAACAGCCTA TCAGTATGAT 300 

TCTCGAGCAA TCGTTGAACA AGGAATTGAA GCGCGCGAAA TCGAAGTTGC 350 

TGTATTAGGA AATGAAGATG TTCGGACGAC TTTGCCTGGC GAAGTCGTAA 4 00 

AAGACGTAGC ATTCTATGAT TATGAAGCCA AATATATCAA TAATAAAATC 450 

GAAATGCAGA TTCCAGCCGA AGTGCCGGAA GAAGTTTATC AAAAAGCGCA 500 

3 0 AGAGTACGCG AAGTTAGCTT ACACGATGTT AGGTGGAAGC GGATTGAGCC 550 

GGTGCGATTT CTTTTTGACA AATAAAAATG AATTATTCCT GAATGAATTA 600 



35 2 ) INFORMATION FOR SEQ ID NO: 1243 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2275 bases 

(B) TYPE: Nucleic acid 
40 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

45 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus faecalis 

(C) ACCESSION NUMBER: M3 8386 



50 



( xv) SEQUENCE DESCRIPTION: SEQ ID NO: 1243 



GGTACCAAAG AAAAAAACGA ACGCCACAAC CAACAGCCTC TAAAGCAACA 50 

CCTGCTTCTG AAATTGAGGG AGATTTAGCA AATGTCAATG AGATTCTTTT 100 

GGTTCACGAT GATCGTGTCG GGTCAGCAAC GATGGGAATG AAAGTCTTAG 150 

AAGAAATTTT AGATAAAGAG AAAATTTCAA TGCCGATTCG AAAAATTAAT 200 

55 ATTAATGAAT TAACTCAACA AACACAGGCT TTAATTGTCA CAAAAGCTGA 250 

ACTAACGGAA CAAGCACGTA AAAAAGCACC GAAAGCGACA CACTTATCAG 3 00 

TAAAAAGTTA TGGTTAATCC CCAAAAATAT GAAACAGTGG GTTTCGCTCT 350 

TAAAAGAAAG TGCCTAGAGA GGAAGAAAAC AATGGAAAAT CTTACGAATA 400 

TTTCAATTGA ATTAAATCAA CAGTTTAATA CAAAAGAAGA AGCTATTCGC 450 

60 TTTTCCGGCC AGAAACTAGT CGAGGCAGGC TGTGTTGAGC CCGCTTATAT 500 
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CGAAGCAATG ATTGAAAGAG ACCAATTGCT ATCTGCCCAT ATGGGGAATT 550 

TTATTGCCAT TCCTCATGGA ACAGAAGAAG CCAAAAAATT AGTGAAAAAA 600 

TCAGGAATCT GTGTAGTGCA AGTCCCAGAG GGCGTTAATT TTGGCACCGA 65 0 

AGAAGATGAA AAAATTGCTA CCGTATTATT TGGGATTGCC GGAGTCGGTG 700 

5 AAGAACATTT GCAATTAGTC CAACAAATTG CACTTTATTG TAGTGATATG 750 

GATAACGTGG TGCAACTTGC CGATGCATTA AGTAAAGAAG AAATAACAGA 8 00 

AAATTTAGCC ATTGCTTAAA GGAGAGAATA AGAATGAACG CAGTACATTT 850 

TGGAGCAGGA AATATTGGAC GCGGCTTTAT TGGCGAAATT TTAGCTAAAA 900 

CGGGTTTCAT ATTACCGTTT GTGGATGTTA ATGGAAACCA TCATCAAGCG 95 0 

10 TTAAAAGAAC GTAAAAGTTA TACAATTGAA TTGGCCGATG CCTCACATCA 10 0 0 

ACAAATTAAC GTTGAAAATG TGACCGGGTT AAATAACATG ACAGAACCAG 105 0 

AAAAAGTAGT AGAAGCAATT GCGGAAGCCG ATTTAGTCAC GACGGCAATT 1100 

GGTCCTAATA TTTTACCAAG AATTGCTGAA TTAATTGCTC AAGGAATTGA 1150 

TGCACGTGCC GAAGCAAATT GTCAAAACGG CCCGCTGGAT ATTATCGCTT 1200 

15 GTGAAAATAT GATTGGTGGT TCAACCTTTT TAGCAGAAGA AGTGGCCATA 125 0 

ATATTTGAAA AACCCAGCTT ATCTGAACAA TGGATTGGTT TTCCTGATGC 13 0 0 

GGCAGTTGAT CGGATTGTTC CATTACAAAA ACATAAAGAT CCACTTTTTG 135 0 

TTCAAGTTGA GCCTTTTTGT GAATGGGTCA TTGATGATAC CAACCGAAAA 14 0 0 

GCCAAAGAGA TTCAGTTAGA AGGCGTCATT ACTTGTCGAT TAGAGCCGTA 145 0 

20 TATTGAACGA AAATTATTTA GTGTAACCAG TGGCCATGCT ACAGTTGCCT 1500 

ATACAGGGGC GTTGTTAGGC TATCAAACCA TTGACGAAGC GATGCAGGAC 1550 

GCCTTAGTGG TAGCGCAACT CAAATCAGTT TTGCAGGAAA CCGGTAAACT 160 0 

TTTAGTGGCC AAATGGAATT TTGATGAACA AGAACATGCA GCCTATATTG 1650 

AAAAAATTAT CAACCGTTTC CAAAATAAAT ATATTTCAGA TGCTATTACA 170 0 

25 CGTGTAGCAC GGACACCAAT CAGAAAATTA GGTGCGCAAG AACGGTTTAT 175 0 

TCGACCAATC CGTGAATTAC AGGAACGCAA TCTAGTGTCG GCCGCATTTA 18 0 0 

TAGCAATGAT TGGTATTGTC TTTAATTATC ATGATCCAGA AGATGAACAA 185 0 

AGCCGTCAAT TACAGGAAAT GCTTGACCAA GAAAGTGTTG ATACAGTGGA 190 0 

TCGCTGAAGT AACGGGCATT GAAGATCCAG AAACGGTTAA AAATATTAAA 195 0 

30 CAAAACGTAG AACTGCTATG CGCGACCACA AGTAGCATAA TTAACAAAAT 2 00 0 

CCTTCTACCA AGATACTTCA CATTTCTTAA TTAAAGAAAA AACAACCGCG 2 05 0 

CCTCACCTGA GCCGACCCCC AAAAGTTAGA CCTAGAAATC TAACTTTTGG 210 0 

AGGTTTTTTT GTATGGCAAA ATACAGTTTT GAAATTTAAA CTTAAACTTG 2150 

TTCATGACTA CTTATATGGT CAAGG AGGT C TAAGGTTTCT CGCAAAGAAG 22 00 

35 TATGGGTTTA AAGATAGTCT CAAATAAGCA AATGGATAAA TGCCTATAAA 2250 

GAACTTGGTG AAGAAGGGGG GATCC 227 5 



40 2) INFORMATION FOR SEQ ID NO: 1244 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 442 bases 

(B) TYPE: Nucleic acid 
45 (C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

50 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Staphylococcus aureus subsp. aureus 

(B) STRAIN: ATCC 25923 



55 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1244 



GATCAATCTT TGTCGGTACA CGATATTCTT CACGACTAAA TAAACGCTCA 50 

TTCGCGATTT TATAAATGAA TGTTGATAAC AATGTTGTAT TATCTACTGA 100 

AATCTCATTA CGTTGCATCG GAAACATTGT GTTCTGTATG TAAAAGCCGT 150 

CTTGATAATC TTTAGTAGTA CCGAAGCTGG TCATACGAGA GTTATATTTT 2 00 

60 CCAGCCAAAA CGATATTTTT ATAATCATTA CGTGAAAAAG GTTTCCCTTC 250 
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ATTATCACAC AAATATTTTA GCTTTTCAGT 
CTTTATCCAT ACGTTGAATA ATTGTACGAT 
ACACCTTTAA TGTTATTTGT TTTAAAAGCA 
ACGATGTGAA TCTTCTAAGA AGTCACCGTA 



PCT/CA00/01150 

TTCTATATCA ACTGTAGCTT 3 00 

TCTGACGCAC CATCTTTTGC 350 

TGAATAAGTT TTTCAAGACA 4 00 

AAATGAAGGA TC 442 



669 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 



PCT/CA00/01150 



10 



2) INFORMATION FOR SEQ ID NO: 1245 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 845 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bacillus anthracis 

(B) STRAIN: CIP 9444 

15 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1245 

GATGGCGGAA AGCTACCAGA AATCTACAAC GCCCTTACGG TAAAACAGAG 50 

CAACGAAAAC GGAACAAGCA TTAACTTAAC ATTTGAAGTT GCACTTCATT 100 

TAGGTGATGA CACAGTTCGT ACAGTTGCAA TGTCTTCCAC AGATGGACTT 150 

20 GTTCGTGGCA CAGAAGTAGA AGATACTGGT AAAGCAATCT CTGTACCAGT 2 00 

TGGTGATGCA ACACTTGGTC GTGTATTTAA CGTATTAGGT GATGCAATTG 250 

ACTTAGATGG TGAGGTTCCT GCGGATGTAC GTCGTGATCC AATTCACCGT 300 

CAAGCACCTG CATTCGAAGA ATTATCTACT AAAGTAGAAA TTCTTGAAAC 350 

TGGTATTAAA GTAGTAGACT TACTTGCTCC TTACATTAAG GGTGGTAAGA 400 

25 TCGGTCTATT CGGTGGTGCC GGTGTAGGTA AAACGGTATT AATTCAGGAA 4 50 

TTAATCAATA ACATCGCACA AGAACACGGT GGTATCTCTG TATTCGCTGG 500 

TGTAGGTGAG CGTACTCGTG AGGGTAATGA CTTATACCAC GAAATGAGCG 550 

ATTCTGGCGT AATTAAGAAA ACTGCGATGG TATTCGGACA AATGAACGAG 600 

CCACCTGGAG CACGTCAACG TGTTGCGTTA ACAGGTTTAA CAATGGCTGA 650 

3 0 GCATTTCCGT GATGAGCAAG GACAAGATGT ACTTCTGTTC ATCGATAATA 700 

TCTTCCGTTT CACGCAAGCA GGTTCTGAAG TATCTGCCCT TCTTGGCCGT 750 

ATGCCATCTG CGGTAGGTTA CCAACCAACA CTTGCAACAG AAATGGGTCA 800 

ATTACAAGAG CGTATTACAT CTACAAATAA AGGGTCTATC ACGTC 845 



35 



50 



2) INFORMATION FOR SEQ ID NO: 1246 



(i) SEQUENCE CHARACTERISTICS: 
40 (A) LENGTH: 656 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

45 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bacillus mycoides 

(B) STRAIN: ATCC 1198 6 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1246 

TGCACTTCAT TTAGGTGATG ACACAGTTCG TACAGTTGCA ATGTCTTCCA 50 
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CAGATGGACT TGTTCGTGGC ACAGAAGTAG AAGATACTGG TAAAGCAATC 100 

TCTGTACCAG TTGGTGATGT AACACTTGGT CGTGTATTTA ACGTATTAGG 150 

TGATGCAATT GACTTAGATG GTGATGTTCC TGCGGATGTA CGTCGTGATC 200 

CAATTCACCG TCAAGCGCCT GCATTCGAAG AGTTATCTAC TAAAGTAGAA 250 

5 ATTCTTGAAA CTGGTATTAA AGTAGTAGAC TTACTTGCTC CTTACATTAA 300 

GGGTGGTAAG ATTGGTCTAT TCGGTGGTGC CGGCGTAGGT AAAACAGTAT 350 

TAATTCAGGA ATTAATTAAT AACATCGCAC AAGAGCACGG TGGTATCTCT 400 

GTATTCGCTG GTGTAGGTGA GCGTACTCGT GAAGGTAACG ACTTATACCA 450 

CGAAATGAGC GATTCTGGCG TAATTAAGAA AACTGCGATG GTATTCGGAC 500 

10 AAATGAACGA GCCACCTGGA GCACGTCAAC GTGTTGCATT AACAGGTTTA 550 

ACAATGGCTG AACATTTCCG TGATGAGCAA GGACAAGACG TACTATTGTT 600 

CATCGATAAC ATCTTCCGTT TCACGCAAGC GGGTTCTGAA GTATCTGCCC 650 

TTCTTG 656 
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2) INFORMATION FOR SEQ ID NO: 1247 



(i) SEQUENCE CHARACTERISTICS : 
20 (A) LENGTH: 791 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

2 5 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bacillus thuringiensis 

(B) STRAIN: ATCC 107 92 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1247 



CGAAAACGGA AGTATTAACT TAACATTTGA AGTTGCACTT CATTTAGGTG 50 

ATGATACAGT TCGTACAGTT GCGATGTCTT C C AC AG AT GG ACTTGTTCGT 100 

35 GGCACAGAAG TAGAAGATAC TGGTAAACCA ATCTCTGTAC CAGTTGGTGA 150 

TGTAACACTT GGTCGCGTAT TTAACGTATT AGGTGATGCA ATTGACTTAG 200 

ATGGTGAGGT TCCTGCAGAT GTACATCGTG ATCCAATTCA CCGTCAAGCA 250 

CCTGCATTCG AAGAATTATC TACTAAAGTA GAAATTCTTG AAACTGGTAT 300 

TAAAGTAGTA GACTTACTTG CTCCTTACAT TAAGGGTGGT AAGATCGGCC 350 

40 TATTCGGTGG TGCCGGCGTA GGTAAAACAG TATTAATTCA GGAATTAATT 400 

AACAACATCG CACAAGAGCA CGGTGGTATC TCTGTATTCG CTGGTGTAGG 450 

TGAGCGTACT CGTGAGGGTA ATGACTTATA CCACGAAATG AGCGATTCTG 500 

GCGTAATCAA GAAAACT GCG ATGGTATTCG GACAAATGAA CGAGCCACCT 550 

GGAGCACGTC AACGTGTTGC ATTAACAGGT TTAACAATGG CTGAGCATTT 600 

45 CCGTGATGAG CAAGGACAAG ACGTACTTCT GTTCATCGAT AACATCTTCC 650 

GTTTCACGCA AGCGGGTTCT GAAGTATCTG CCCTTCTTGG TCGTATGCCA 700 

TCTGCGGTAG GTTACCAACC AACACTTGCA AC AG AAAT GG GTCAATTACA 750 

AGAGCGTATT ACATCTACAA ATAAAGGGTC TATCACGTCT A 7 91 



2) INFORMATION FOR SEQ ID NO: 1248 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 825 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bacillus thuringiensis 

(B) STRAIN: BGSC 4AC1 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1248 

ATCTACAATG CCCTTACGGT AAAACAAAGC AACGAAAACG GAAGCATGAA 50 

CTTAACATTT GAAGTTGCAC TTCATTTAGG TGATGATACA GTTCGTACAG 100 

TTGCGATGTC TTCCACAGAT GGACTTGTTC GTGGCACAGA AGTAGAAGAT 150 

ACTGGTAAAG CAATCTCTGT ACCAGTTGGT GATGCAACAC TTGGACGTGT 200 

ATTCAACGTA TTAGGTGATG CAATTGACTT AGATGGTGAA CTTCCTGCGG 250 

ATGTACACCG TGATCCAATT CACCGTCAAG CACCTGCATT CGAAGAATTA 300 

TCTACTAAAG TAGAAATTCT TGAAACTGGT ATTAAAGTAG TAGACTTACT 350 

TGCTCCTTAC ATTAAGGGTG GTAAGATCGG CCTATTCGGT GGTGCCGGCG 400 

TAGGTAAAAC AGTATTAATT CAGGAGTTAA TCAATAACAT CGCACAAGAG 450 

CACGGTGGTA TCTCTGTATT CGCTGGTGTA GGTGAGCGTA CTCGTGAGGG 500 

TAATGACTTA TACCACGAAA TGAGCGATTC TGGCGTAATC AAGAAAACTG 550 

CGATGGTATT CGGACAAATG AACGAGCCAC CTGGAGCACG TCAACGTGTT 600 

GCATTAACAG GTTTAACAAT GGCTGAGCAT TTCCGTGATG AGCAAGGACA 650 

AGACGTACTT CTGTTCATCG ATAACATCTT CCGTTTCACG CAAGCGGGTT 700 

CTGAAGTATC TGCCCTTCTT GGTCGTATGC CATCTGCGGT AGGTTACCAA 7 50 

CCAACACTTG CAACAGAAAT GGGTCAATTA CAAGAGCGTA TTACATCTAC 800 

AAATAAAGGG TCTATCACGT CTATC 825 



2) INFORMATION FOR SEQ ID NO: 124 9 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 775 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bacillus thuringiensis 

(B) STRAIN: HER 1236 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1249 

ATCTACAACG CCCTTACGGT AAAACAAAGC AACGAAAACG GAAGTATTAA 50 
CTTAACATTT GAAGTTGCAC TTCATTTAGG TGATGATACA GTTCGTACAG 100 
TTGCGATGTC TTCCACAGAT GGACTTGTTC GTGGCACAGA AGTAGAAGAT 150 
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ACTGGTAAAC 
ATTTAACGTA 
ATGTACATCG 
TCTACTAAAG 
TGCTCCTTAC 
TAGGTAAAAC 
CACGGTGGTA 
TAATGACTTA 
CGATGGTATT 
GCATTAACAG 
AGACGTACTT 
CTGAAGTATC 
CCAACACTTG 



CAATCTCTGT 
TTAGGTGATG 
TGATCCAATT 
TAGAAATTCT 
ATTAAGGGTG 
AGTATTAATT 
TCTCTGTATT 
TACCACGAAA 
CGGACAAATG 
GTTTAACAAT 
CTGTTCATCG 
TGCCCTTCTT 
CAACAGAAAT 



ACCAGTTGGT 
CAATTGACTT 
CACCGTCAAG 
TGAAACTGGT 
GTAAGATCGG 
CAGGAATTAA 
CGCTGGTGTA 
TGAGCGATTC 
AACGAGCCAC 
GGCTGAGCAT 
ATAACATCTT 
GGTCGTATGC 
GGGTC 



GATGTAACAC TTGGTCGCGT 
AGATGGTGAG GTTCCTGCAG 
CACCTGCATT CGAAGAATTA 
ATTAAAGTAG TAGACTTACT 
CCTATTCGGT GGTGCCGGCG 
TTAACAACAT CGCACAAGAG 
GGTGAGCGTA CTCGTGAGGG 
TGGCGTAATC AAGAAAACTG 
CTGGAGCACG TCAACGTGTT 
TTCCGTGATG AGCAAGGACA 
CCGTTTCACG CAAGCGGGTT 
CATCTGCGGT AGGTTACCAA 



200 
250 
300 
350 
400 
450 
500 
550 
600 
650 
700 
750 
775 



2) INFORMATION FOR SEQ ID NO: 1250 

{i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 832 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bacillus weihenstephanensis 

(B) STRAIN: WSBC 10204 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1250 

CCAGCAATCT ACAACGCCCT TACGGTAAAA CAAAGCAACG AAAACGGAGC 50 

GAGCATCAAC TTAACATTTG AAGTTGCACT TCATTTAGGT GATGACACAG 100 

TTCGTACAGT TGCGATGTCT TCCACAGATG GACTTGTTCG TGGCACAGAA 150 

GTAGAAGATA CTGGTAAAGC AATCTCTGTA CCAGTTGGTG ATGTAACACT 200 

TGGTCGCGTA TTCAACGTAT TAGGTGATGC AATTGACTTA GATGGTGATG 250 

TTCCTGCGGA TGTACGTCGT GATCCAATTC ACCGTCAAGC ACCTGCATTC 300 

GAAGAACTAT CTACAAGAAT AGAAATTCTT GAAACTGGTA TTAAAGTAGT 350 

AGATTTACTT GCTCCTTACA TTAAGGGTGG TAAGATCGGT CTATTCGGTG 4 00 

GTGCCGGTGT AGGTAAAACG GTATTAATTC AGGAATTAAT TAACAACATC 4 50 

GCACAAGAGC ACGGTGGTAT CTCTGTATTC GCTGGTGTAG GTGAGCGTAC 500 

TCGTGAGGGT AATGACTTAT ACCACGAAAT GAGCGATTCT GGCGTAATTA 550 

AGAAAACTGC GATGGTATTT GGACAAATGA ACGAGCCACC TGGAGCACGT 600 

CAACGTGTTG CATTAACAGG TTTAACAATG GCTGAACATT TCCGTGATGA 650 

GCAAGGACAA GACGTACTAT TGTTCATCGA TAACATCTTC CGTTTCACGC 700 

AAGCAGGTTC TGAAGTATCT GCCCTTCTTG GTCGTATGCC ATCTGCGGTA 750 

GGTTACCAAC CAACACTTGC AACAGAAATG GGTCAATTAC AAGAGCGTAT 800 

TACATCTACA AATAAAGGGT CTATCACGTC TA 832 



2) INFORMATION FOR SEQ ID NO: 1251 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 802 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

<ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bacillus thuringiensis 

(B) STRAIN: HER 1418 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1251 

AAATCTACAA CGCCCTTACG GTAAAACAAA GCAACGAAAA CGGAAGTATT 50 

AACTTAACAT TTGAAGTTGC ACTTCATTTA GGTGATGATA CAGTTCGTAC 100 

AGTTGCGATG TCTTCCACAG ATGGACTTGT TCGTGGCACA GAAGTAGAAG 150 

ATACTGGTAA ACCAATCTCT GTACCAGTTG GTGATGTAAC ACTTGGTCGC 200 

GTATTTAACG TATTAGGTGA TGCAATTGAC TTAGATGGTG AGGTTCCTGC 250 

AGATGTACAT CGTGATCCAA TTCACCGTCA AGCACCTGCA TTCGAAGAAT 300 

TATCTACTAA AGTAGAAATT CTTGAAACTG GTATTAAAGT AGTAGACTTA 350 

CTTGCTCCTT ACATTAAGGG TGGTAAGATC GGCCTATTCG GTGGTGCCGG 400 

CGTAGGTAAA ACAGTATTAA TTCAGGAATT AATTAACAAC ATCGCACAAG 450 

AGCACGGTGG TATCTCTGTA TTCGCTGGTG TAGGTGAGCG TACTCGTGAG 500 

GGTAATGACT TATACCACGA AATGAGCGAT TCTGGCGTAA TCAAGAAAAC 550 

TGCGATGGTA TTCGGACAAA TGAACGAGCC ACCTGGAGCA CGTCAACGTG 600 

TTGCATTAAC AGGTTTAACA ATGGCTGAGC ATTTCCGTGA TGAGCAAGGA 650 

CAAGACGTAC TTCTGTTCAT CGATAACATC TTCCGTTTCA CGCAAGCGGG 700 

TTCTGAAGTA TCTGCCCTTC TTGGTCGTAT GCCATCTGCG GTAGGTTACC 750 

AACCAACACT TGCAACAGAA ATGGGTCAAT TACAAGAGCG TATTACATCT 800 

AC 802 



2) INFORMATION FOR SEQ ID NO: 1252 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 823 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bacillus thuringiensis 

(B) STRAIN: HER 1410 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1252 

AAATCTACAA CGCCCTTACG GTAAAACAAA GCAACGAAAA CGGAAGTATT 50 
AACTTAACAT TTGAAGTTGC ACTTCATTTA GGTGATGATA CAGTTCGTAC 100 
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AGTTGCAATG TCTTCCACAG ATGGACTTGT 
ATACTGGTAA ACCAATCTCT GTACCAGTTG 
GTATTTAACG TATTAGGTGA TGCAATTGAC 
AGATGTACAT CGTGATCCAA TTCACCGTCA 
5 TATCTACTAA AGTAGAAATT CTTGAAACTG 
CTTGCTCCTT ACATTAAGGG TGGTAAGATC 
CGTAGGTAAA ACAGTATTAA TTCAGGAATT 
AGCACGGTGG TATCTCTGTA TTCGCTGGTG 
GGTAATGACT TATACCACGA AATGAGCGAT 

10 TGCGATGGTA TTCGGACAAA TGAACGAGCC 
TTGCATTAAC AGGCTTAACA ATGGCTGAGC 
CAAGACGTAC TTCTGTTCAT CGATAACATC 
TTCTGAAGTA TCTGCCCTTC TTGGTCGTAT 
AACCAACACT TGCAACAGAA ATGGGTCAAT 

15 ACAAATAAAG GGTCTATCAC GTC 



PCT/CA00/01150 

TCGTGGCACA GAAGTAGAAG 150 

GTGATGTAAC ACTTGGTCGC 200 

TTAGATGGTG AGGTTCCTGC 250 

AGCACCTGCA TTCGAAGAAT 300 

GTATTAAAGT AGTAGACTTA 350 

GGCCTATTCG GTGGTGCCGG 4 00 

AATTAACAAC ATCGCACAAG 450 

TAGGTGAGCG TACTCGTGAG 500 

TCTGGCGTAA TCAAGAAAAC 550 

ACCTGGAGCA CGTCAACGTG 600 

ATTTCCGTGA TGAGCAAGGA 650 

TTCCGTTTCA CGCAAGCGGG 7 00 

GCCATCTGCG GTAGGTTACC 7 50 

TACAAGAGCG TATTACATCT 8 00 

823 



2) INFORMATION FOR SEQ ID NO: 1253 



20 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7 98 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
25 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
30 (A) ORGANISM: Bacillus cereus 

(B) STRAIN: ATCC 13472 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1253 

35 CCAGAAATCT ACAATGCCCT TACGGTAAAA CAAAGCAACG AAAACGGAAG 50 

CATGAACTTA ACATTTGAAG TTGCACTTCA TTTAGGTGAT GATACAGTTC 100 

GTACAGTTGC GATGTCTTCC ACAGATGGAC TTGTTCGTGG CACAGAAGTA 150 

GAAGATACTG GTAAAGCAAT CTCTGTACCA GTTGGTGATG CAACACTTGG 200 

ACGTGTATTC AACGTATTAG GTGATGCAAT TGACTTAGAT GGTGAACTTC 250 

40 CTGCGGATGT ACACCGTGAT CCAATTCACC GTCAAGCACC TGCATTCGAA 300 

GAATTATCTA CTAAAGTAGA AATTCTTGAA ACTGGTATTA AAGTAGTAGA 350 

CTTACTTGCT CCTTACATTA AGGGTGGTAA GATCGGCCTA TTCGGTGGTG 400 

CCGGCGTAGG TAAAACAGTA TTAATTCAGG AATTAATCAA TAACATCGCA 4 50 

CAAGAGCATG GTGGTATCTC TGTATTCGCT GGTGTAGGTG AGCGTACTCG 500 

45 TGAGGGTAAT GACTTATACC ACGAAATGAG CGATTCTGGC GTAATCAAGA 550 

AAACTGCGAT GGTATTCGGA CAAATGAACG AGCCACCTGG AGCACGTCAA 600 

CGTGTTGCAT TAACAGGTTT AACAATGGCT GAGCATTTCC GTGATGAGCA 650 

AGGACAAGAC GTACTTCTGT TCATCGATAA CATCTTCCGT TTCACGCAAG 700 

CGGGTTCTGA AGTATCTGCC CTTCTTGGTC GTATGCCATC TGCGGTAGGT 750 

50 TACCAACCGA CACTTGCAAC AGAAATGGGT CAATTACAAG AGCGTATT 7 98 
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2) INFORMATION FOR SEQ ID NO: 1254 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7 67 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bacillus cereus 

(B) STRAIN: ATCC 7064 

15 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1254 

CATTTGAAGT TGCACTTCAT TTAGGTGATG ACACAGTTCG TACAGTTGCA 50 

ATGTCTTCCA CAGATGGACT TGTTCGTGGC ACAGAAGTAG AAGATACTGG 100 

TAAAGCAATC TCTGTACCAG TTGGTGATGC AACACTTGGT CGTGTATTTA 150 

20 ACGTATTAGG TGATGCAATT GACTTAGATG GTGAGGTTCC TGCGGATGTA 200 

CGTCGTGATC CAATTCACCG TCAAGCACCT GCATTCGAAG AATTATCTAC 2 50 

TAAAGTAGAA ATTCTTGAAA CTGGTATTAA AGTAGTAGAC TTACTTGCTC 300 

CTTACATTAA GGGTGGTAAG ATCGGTCTAT TCGGTGGTGC CGGTGTAGGT 350 

AAAACGGTAT TAATTCAGGA ATTAATCAAT AACATCGCAC AAGAACACGG 400 

25 TGGTATCTCT GTATTCGCTG GTGTAGGTGA GCGTACTCGT GAGGGTAATG 4 50 

ACTTATACCA CGAAATGAGC GATTCTGGCG TAATTAAGAA AACTGCGATG 500 

GTATTCGGAC AAATGAACGA GCCACCTGGA GCACGTCAAC GTGTTGCGTT 550 

AACAGGTTTA ACAATGGCTG AGCATTTCCG TGATGAGCAA GGACAAGACG 600 

TACTTCTGTT CATCGATAAT ATCTTCCGTT TCACGCAAGC AGGTTCTGAA 650 

30 GTATCTGCCC TTCTTGGCCG TATGCCATCT GCGGTAGGTT ACCAACCAAC 7 00 

ACTTGCAACA GAAATGGGTC AATTACAAGA GCGTATTACA TCTACAAATA 7 50 

AAGGGTCTAT CACGTCT 7 67 



35 



45 



2) INFORMATION FOR SEQ ID NO: 1255 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 1174 bases 
40 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Staphylococcus aureus 

(B) STRAIN: C-14 

50 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1255 

GAAATGCGTG AATCATTTTT AGATTATGCG ATGAGTGTTA TCGTTGCTCG 50 
TGCATTGCCA GATGTTCGTG ACGGTTTAAA ACCAGTACAT CGTCGTATAC 100 
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TATATGGATT AAATGAACAA GGTATGACAC CGGATAAATC ATATAAAAAA 150 

TCAGCACGTA TCGTTGGTGA CGTAATGGGT AAATATCACC CTCATGGTGA 200 

CTTATCTATT TATGAAGCAA TGGTACGTAT GGCTCAAGAT TTCAGTTATC 250 

GTTATCCGCT TGTTGATGGC CAAGGTAACT TTGGTTCAAT GGATGGAGAT 300 

5 GGCGCAGCAG CAATGCGTTA TACTGAAGCG CGTATGACTA AAATCACACT 350 

TGAACTGTTA CGTGATATTA ATAAAGATAC AATAGATTTT ATCGATAACT 4 00 

ATGATGGTAA TGAAAGAGAG CCGTCAGTCT TACCTGCTCG ATTCCCTAAC 4 50 

TTATTAGCCA ATGGTGCATC AGGTATCGCG GTAGGTATGG CAACGAATAT 500 

TCCACCACAT AACTTAACAG AATTAATCAA TGGTGTACTT AGCTTAAGTA 550 

10 AGAACCCTGA TATTTCAATT GCTGAGTTAA TGGAGGATAT TGAAGGTCCT 600 

GATTTCCCAA CTGCTGGACT TATTTTAGGT AAGAGTGGTA TTAGACGTGC 650 

ATATGAAACA GGTCGTGGTT CAATTCAAAT GCGTTCTCGT GCAGTTATTG 700 

AAGAACGTGG AGNCGGACGT CAACGTATTG TTGTCACTGA AATTCCTTTC 750 

CAAGTGAATA AGGCTCGTAT GATTGAAAAA ATTGCAGAGC TCGTTCGTGA 8 00 

15 CAAGAAAATT GACGGTATCA CTGATTTACG TGATGAAACA AGTTTACGTA 850 

CTGGTGTGCG TGTCGTTATT GATGTGCGTA AGGATGCAAA TGCTAGTGTC 900 

ATTTTAAATA ACTTATACAA ACAAACACCT CTTCAAACAT CATTTGGTGT 950 

GAATATGATT GCACTTGTAA ATGGTAGACC GAAGCTTATT AATTTAAAAG 1000 

AAGCGTTGGT ACATTATTTA GAGCATCAAA AGACAGTTGT TAGAAGACGT 1050 

20 ACGCAATACA ACTTACGTAA AGCTAAAGAT CGTGCCCACA TTTTAGAAGG 1100 

ATTACGTATC GCACTTGACC ATATCGATGA AATTATTTCA ACGATTCGTG 1150 

AGTCAGATAC AGATAAAGTT GCAA 1174 



25 

2) INFORMATION FOR SEQ ID NO: 1256 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 780 bases 
30 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

35 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bacillus weihenstephanensis 

(B) STRAIN: WSBC 10209 

40 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1256 

ATCTACAACG CCCTTACGGT AAAACAAAGC AACGAAAACG GAGCAAGCAT 50 

TAACTTAACA TTTGAAGTTG CACTTCATTT AGGTGATGAC ACAGTTCGTA 100 

CAGTTGCAAT GTCTTCCACA GATGGACTTG TTCGTGGCAC AGAAGTAGAA 150 

45 GATACTGGTA AAGCAATCTC TGTACCAGTT GGTGATGTAA CACTTGGTCG 200 

TGTATTTAAC GTATTAGGTG ATGCAATTGA CTTAGATGGA GATGTTCCTG 250 

CGGATGTACG TCGTGATCCA ATTCACCGTC AAGCGCCTGC ATTCGAAGAG 300 

TTATCTACTA AAGTAGAAAT TCTTGAAACT GGTATTAAAG TAGTAGACTT 350 

ACTTGCTCCT TACATTAAGG GTGGTAAGAT CGGTCTATTC GGTGGTGCCG 400 

50 GTGTAGGTAA AACAGTATTA ATTCAGGAAT TAATTAACAA CATCGCACAA 4 50 

GAGCACGGTG GTATCTCTGT ATTCGCTGGT GTAGGTGAGC GTACTCGTGA 500 

AGGTAACGAC TTATACCACG AAATGAGCGA TTCTGGCGTA ATTAAGAAAA 550 

CTGCGATGGT AT TCGGACAA ATGAACGAGC CACCTGGAGC ACGTCAACGT 600 
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GTTGCATTAA CAGGTTTAAC 
GCAAGACGTA CTATTGTTCA 
GTTCTGAAGT ATCTGCCCTT 
CAGCCAACAC TTGCAACAGA 



AATGGCTGAA CATTTCCGTG 
TCGATAACAT CTTCCGTTTC 
CTTGGTCGTA TGCCATCTGC 
AATGGGTCAA 



ATGAGCAAGG 650 
ACGCAAGCGG 700 
GGTAGGTTAC 750 

780 



2) INFORMATION FOR SEQ ID NO: 1257 

10 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 817 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



15 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bacillus anthracis 
20 (B) STRAIN: CIP 7700 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1257 

CTACCAGAAA TCTACAACGC CCTTACGGTA AAACAGAGCA ACGAAAACGG 50 

25 AACAAGCATT AACTTAACAT TTGAAGTTGC ACTTCATTTA GGTGATGACA 100 

CAGTTCGTAC AGTTGCAATG TCTTCCACAG ATGGACTTGT TCGTGGCACA 150 

GAAGTAGAAG AT ACT GGT AA AGCAATCTCT GTACCAGTTG GTGATGCAAC 200 

ACTTGGTCGT GTATTTAACG TATTAGGTGA TGCAATTGAC TTAGATGGTG 250 

AGGTTCCTGC GGATGTACGT CGTGATCCAA TTCACCGTCA AGCACCTGCA 300 

3 0 TTCGAAGAAT TATCTACTAA AGTAGAAATT CTTGAAACTG GTATTAAAGT 350 

AGTAGACTTA CTTGCTCCTT ACATTAAGGG TGGTAAGATC GGTCTATTCG 4 00 

GTGGTGCCGG TGTAGGTAAA ACGGTATTAA TTCAGGAATT AATCAATAAC 4 50 

ATCGCACAAG AACACGGTGG TATCTCTGTA TTCGCTGGTG TAGGTGAGCG 500 

, - TACTCGTGAG GGTAATGACT TATACCACGA AATGAGCGAT TCTGGCGTAA 550 

35 TTAAGAAAAC TGCGATGGTA TTCGGACAAA TGAACGAGCC ACCTGGAGCA 600 

CGTCAACGTG - TTGCGTTAAC AGGTTTAACA ATGGCTGAGC ATTTCCGTGA 650 

TGAGCAAGGA CAAGATGTAC TTCTGTTCAT CGATAATATC TTCCGTTTCA 7 00 

CGCAAGCAGG TTCTGAAGTA TCTGCCCTTC TTGGCCGTAT GCCATCTGCG 7 50 

GTAGGTTACC AACCAACACT TGCAACAGAA ATGGGTCAAT TACAAGAGCG 800 

40 TATTACATCT ACAAATA 817 



45 



2) INFORMATION FOR SEQ ID NO: 1258 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 829 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
50 (D) TOPOLOGY: Linear 

.(ii) MOLECULE TYPE: Genomic DNA 
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(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bacillus thuringiensis 

(B) STRAIN: HER 1404 

5 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1258 

CCAGAAATCT ACAACGCCCT TACGGTAAAA CAAAGCAACG AAAACGGAAG 50 

TATTAACTTA ACATTTGAAG TTGCACTTCA TTTAGGTGAT GATACAGTTC 100 

GTACAGTTGC GATGTCTTCC ACAGATGGAC TTGTTCGTGG CACAGAAGTA 150 

10 GAAGATACTG GTAAACCAAT CTCTGTACCA GTTGGTGATG TAACACTTGG 200 

TCGCGTATTT AACGTATTAG GTGATGCAAT TGACTTAGAT GGTGAGGTTC 250 

CTGCAGATGT ACATCGTGAT CCAATTCACC GTCAAGCACC TGCATTCGAA 300 

GAATTATCTA CTAAAGTAGA AATTCTTGAA ACTGGTATTA AAGTAGTAGA 350 

CTTACTTGCT CCTTACATTA AGGGTGGTAA GATCGGCCTA TTCGGTGGTG 4 00 

15 CCGGCGTAGG TAAAACAGTA TTAATTCAGG AATTAATTAA CAACATCGCA 4 50 

CAAGAGCACG GTGGTATCTC TGTATTCGCT GGTGTAGGTG AGCGTACTCG 500 

TGAGGGTAAT GACTTATACC AC G AAAT GAG CGATTCTGGC GTAATCAAGA 550 

AAACTGCGAT GGTATTCGGA CAAATGAACG AGCCACCTGG AGCACGTCAA 600 

CGTGTTGCAT TAACAGGTTT AACAATGGCT GAGCATTTCC GTGATGAGCA 650 

20 AGGACAAGAC GTACTTCTGT TCATCGATAA CATCTTCCGT TTCACGCAAG 700 

CGGGTTCTGA AGTATCTGCC CTTCTTGGTC GTATGCCATC TGCGGTAGGT 7 50 

TACCAACCAA CACTTGCAAC AGAAATGGGT CAATTACAAG AGCGTATTAC 8 00 

ATCTACAAAT AAAGGGTCTA TCACGTCTA 829 



25 



2) INFORMATION FOR SEQ ID NO: 1259 



(i) SEQUENCE CHARACTERISTICS: 
30 (A) LENGTH: 844 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

35 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bacillus cereus 

(B) STRAIN: ATCC 15816 
40 (C) ACCESSION NUMBER: 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1259 

TGGCGGAAAG CTACCAGAAA TCTACAACGC CCTTACGGTA AAACAGAGCA 50 

45 ACGAAAACGG TGAACTTAAC TTAACATTTG AAGTTGCACT TCATTTAGGT 100 

GAT G AT ACAG TTCGTACAGT TGCGATGTCT TCCACAGATG GACTTGTTCG 150 

TGGCACAGAA GTAGAAGATA CTGGTAAAGC AATCTCTGTA CCAGTTGGTG 200 

ATGCAACACT TGGTCGCGTA TTTAACGTAT TAGGTGATGC TATTGACTTA 250 

GATGGTGAGG TTCCTGCGGA TGTACGTCGT GATCCAATTC ACCGTCAAGC 300 

50 ACCTGCATTC GAAGAATTAT CTACTAAAGT AGAAATTCTT GAAACTGGTA 350 

TTAAAGTAGT AGACTTACTT GCTCCTTACA TTAAGGGTGG TAAGATCGGC 4 00 

CTATTCGGTG GTGCCGGTGT AGGTAAAACA GTATTAATTC AGGAGTTAAT 4 50 

CAACAACATC GCACAAGAGC ACGGTGGTAT CTCTGTATTC GCTGGTGTAG 500 
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20 



45 



GTGAGCGTAC TCGTGAGGGT AATGACTTAT ACCACGAAAT GAGCGATTCT 550 

GGCGTAATTA AGAAAACTGC GATGGTATTC GGACAAATGA ACGAGCCACC 600 

TGGAGCACGT CAACGTGTTG CATTAACAGG CTTAACAATG GCTGAATATT 650 

TCCGTGATGA GCAAGGACAA GACGTACTTC TGTTCATCGA TAATATCTTC 700 

CGTTTCACGC AAGCAGGTTC TGAAGTATCT GCCCTTCTTG GCCGTATGCC 7 50 

ATCTGCGGTA GGTTACCAAC CAACACTTGC AACAGAAATG GGTCAATTAC 800 

AAGAGCGTAT TACATCTACA AATAAAGGGT CTATCACGTC TATC 844 



2) INFORMATION FOR SEQ ID NO: 1260 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 840 bases 
15 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bacillus cereus 

(B) STRAIN : ATCC 49064 

25 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12 60 

AAGCTACCAG AAATCTACAA CGCCCTTACG GTAAAACAGA GCAACGAAAA 50 

CGGAACAAGC ATTAACTTAA CATTTGAAGT TGCACTTCAT TTAGGTGATG 100 

ACACAGTTCG TACAGTTGCA AT GT CT T CCA CAGATGGACT TGTTCGTGGC 150 

3 0 ACAGAAGTAG AAGATACTGG TAAAGCAATC TCTGTACCAG TTGGTGATGC 200 

AACACTTGGT CGTGTATTTA ACGTATTAGG TGATGCAATT GACTTAGATG 250 

GTGAGGTTCC TGCGGATGTA CGCCGTGATC CAATTCACCG TCAAGCACCT 300 

GCATTCGAAG AATTATCTAC TAAAGTAGAA ATTCTTGAAA CTGGTATTAA 350 

AGTAGTAGAC TTACTTGCTC CTTACATTAA GGGTGGTAAG ATCGGTCTAT 400 

35 TCGGTGGTGC CGGTGTAGGT AAAACAGTAT TAATTCAGGA ATTAATCAAC 450 

AACATCGCAC AAGAACACGG TGGTATCTCT GTATTCGCTG GTGTAGGTGA 500 

GCGTACTCGT GAGGGTAATG ACTTATACCA CGAAATGAGC GATTCAGGCG 550 

TAATTAAGAA AACTGCGATG GTATTCGGAC AAAT GAACGA GCCACCTGGA 600 

GCGCGTCAAC GTGTTGCGTT AACAGGTTTA ACAATGGCTG AGCATTTCCG 650 

40 TGATGAGCAA GGACAAGACG TTCTTCTGTT CATCGATAAT ATCTTCCGTT 700 

TCACGCAAGC AGGTTCTGAA GTATCTGCCC TTCTTGGTCG TATGCCATCT 750 

GCGGTAGGTT ACCAACCAAC ACTTGCAACA GAAATGGGTC AATTACAAGA 800 

GCGTATTACA TCTACAAATA AAGGGTCTAT CACGTCTATC 84 0 



2) INFORMATION FOR SEQ ID NO: 1261 



(i) SEQUENCE CHARACTERISTICS: 
50 (A) LENGTH: 839 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 
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(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bacillus thuringiensis 

(B) STRAIN: BGSC 4AZ1 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1261 



10 GCGGAAAGCT 
GAAAACGGAA 
TGATACAGTT 
GCACAGAAGT 
GCAACACTTG 

15 TGGTGAACTT 
CTGCATTCGA 
AAAGTAGTAG 
ATTCGGTGGT 
ATAACATCGC 

20 GAGCGTACTC 
CGTAATCAAG 
GAGCACGTCA 
CGTGATGAGC 
TTTCACGCAA 

25 CTGCGGTAGG 
GAGCGTATTA 



ACCAGAAATC 
GCATGAACTT 
CGTACAGTTG 
AGAAGATACT 
GACGTGTATT 
CCTGCGGATG 
AGAATTATCT 
ACTTACTTGC 
GCCGGCGTAG 
ACAAGAGCAC 
GTGAGGGTAA 
AAAACTGCGA 
ACGTGTTGCA 
AAGGACAAGA 
GCGGGTTCTG 
TTACCAACCA 
CATCTACAAA 



TACAATGCCC 
AACATTTGAA 
CGATGTCTTC 
GGTAAAGCAA 
CAACGTATTA 
TACACCGTGA 
ACTAAAGTAG 
TCCTTACATT 
GTAAAACAGT 
GGTGGTATCT 
TGACTTATAC 
TGGTATTCGG 
TTAACAGGTT 
CGTACTTCTG 
AAGTATCTGC 
ACACTTGCAA 
TAAAGGGTCT 



TTACGGTAAA 
GTTGCACTTC 
CACAGATGGA 
TCTCTGTACC 
GGTGATGCAA 
TCCAATTCAC 
AAATTCTTGA 
AAGGGTGGTA 
ATTAATTCAG 
CTGTATTCGC 
CACGAAATGA 
ACAAATGAAC 
TAACAAT GGC 
TTCATCGATA 
CCTTCTTGGT 
CAGAAAT GGG 
ATCACGTCT 



ACAAAGCAAC 50 

ATTTAGGTGA 100 

CTTGTTCGTG 150 

AGTTGGTGAT 200 

TTGACTTAGA 250 

CGTCAAGCAC 300 

AACTGGTATT 350 

AGATCGGCCT 4 00 

GAGTTAATCA 450 

TGGTGTAGGT 500 

GCGATTCTGG 550 

GAGCCACCTG 600 

TGAGCATTTC 650 

ACATCTTCCG 700 

CGTATGCCAT 750 

TCAATTACAA 800 

839 



30 2) INFORMATION FOR SEQ ID NO: 1262 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 833 bases 

(B) TYPE: Nucleic acid 
35 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

4 0 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bacillus thuringiensis 

(B) STRAIN: BGSC 4H2 



45 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1262 



AAGCTACCAG AAATCTACAA TGCCCTTACG GTAAAACAAA GCAACGAAAA 50 

CGGAAGCATG AACTTAACAT TTGAAGTTGC ACTTCATTTA GGTGATGATA 100 

CAGTTCGTAC AGTTGCGATG TCTTCCACAG ATGGACTTGT TCGTGGCACA 150 

GAAGTAGAAG ATACTGGTAA AGCAATCTCT GTACCAGTTG GTGATGCAAC 200 

50 ACTTGGACGT GTATTCAACG TATTAGGTGA TGCAATTGAC TTAGATGGTG 250 

AACTTCCTGC GGATGTACAC CGTGATCCAA TTCACCGTCA AGCACCTGCA 300 

TTCGAAGAAT TATCTACTAA AGTAGAAATT CTTGAAACTG GTATTAAAGT 350 

AGTAGACTTA CTTGCTCCTT ACATTAAGGG TGGTAAGATC GGCCTATTCG 400 
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GTGGTGCCGG CGTAGGTAAA 
ATCGCACAAG AGCACGGTGG 
TACTCGTGAG GGTAATGACT 
TCAAGAAAAC TGCGATGGTA 
5 CGTCAACGTG TTGCATTAAC 
TGAGCAAGGA CAAGACGTAC 
CGCAAGCGGG TTCTGAAGTA 
GTAGGTTACC AACCAACACT 
TATTACATCT ACAAATAAAG 



ACAGTATTAA TTCAGGAATT 
TATCTCTGTA TTCGCTGGTG 
TATACCACGA AATGAGCGAT 
TTCGGACAAA TGAACGAGCC 
AGGTTTAACA ATGGCTGAGC 
TTCTGTTCAT CGATAACATC 
TCTGCCCTTC TTGGTCGTAT 
TGCAACAGAA ATGGGTCAAT 
GGTCTATCAC GTC 



PCT/CA00/01150 

AATCAATAAC 450 

TAGGTGAGCG 500 

TCTGGCGTAA 550 

ACCTGGAGCA 600 

ATTTCCGTGA 650 

TTCCGTTTCA 700 

GCCATCTGCG 750 

TACAAGAGCG 800 

833 



2) INFORMATION FOR SEQ ID NO: 1263 

15 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 790 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



20 



(ii) MOLECULE TYPE: Genomic DNA 



(Vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bacillus thuringiensis 
25 (B) STRAIN: BGSC 4Q1 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1263 

CGAAAACGGA AGCATGAACT TAACATTTGA AGTTGCACTT CATTTAGGTG 50 

30 ATGATACAGT TCGTACAGTT GCGATGTCTT CCACAGATGG ACTTGTTCGT 100 

GGCACAGAAG TAGAAGATAC TGGTAAAGCA ATTTCTGTAC CAGTTGGTGA 150 

TGTAACACTT GGACGTGTAT TCAACGTATT AGGTGATGCA ATTGACTTAG 200 

ATGGTGAACT TCCTGCGGAT GTACACCGTG ATCCAATTCA CCGTCAAGCA 250 

CCTGCATTCG AAGAATTATC TACTAAAGTA GAAATTCTTG AAACTGGTAT 300 

35 TAAAGTAGTA GACTTACTTG CTCCTTACAT TAAGGGTGGT AAGATCGGCC 350 

TATTCGGTGG TGCCGGTGTA GGTAAAACAG TATTAATTCA GGAATTAATT 400 

AACAACATCG CACAAGAGCA CGGTGGTATC TCTGTATTCG CTGGTGTAGG 4 50 

TGAGCGTACT CGTGAGGGTA ATGACTTATA CCACGAAATG AGCGATTCTG 500 

GCGTAATCAA GAAAACTGCG ATGGTATTCG GACAAATGAA CGAGCCACCT 550 

40 GGAGCACGTC AACGTGTTGC ATTAACAGGT TTAACAATGG CTGAGCATTT 600 

CCGTGATGAG CAAGGACAAG ACGTACTTCT GTTCATCGAT AACATCTTCC 650 

GTTTCACGCA AGCGGGTTCT GAAGTATCTG CCCTTCTTGG TCGTATGCCA 700 

TCTGCGGTAG GTTACCAACC AACACTTGCA ACAGAAATGG GTCAATTACA 750 

AGAGCGTATT ACATCTACAA ATAAAGGGTC TATCACGTCT 7 90 

45 



2) INFORMATION FOR SEQ ID NO: 1264 

50 <i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 64 4 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
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(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 



5 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bacillus thuringiensis 

(B) STRAIN: HER 1232 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1264 

10 



15 



20 



GCGATGTCTT 


. 50 


TGGTAAACCA 


100 


TTAACGTATT 


150 


GTACATCGTG 


200 


TACTAAAGTA 


250 


CTCCTTACAT 


300 


GGTAAAACAG 


350 


CGGTGGTATC 


400 


ATGACTTATA 


450 


ATGGTATTCG 


500 


ATTAACAGGT 


550 


ACGTACTTCT 


600 


GAAG 


644 



25 



2 ) INFORMATION FOR SEQ ID NO: 12 65 



(i) SEQUENCE CHARACTERISTICS: 
30 (A) LENGTH: 823 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

35 (ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bacillus anthracis 

(B) STRAIN: ATCC 4229 

40 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1265 



GGCGGAAAGC TACCAGAAAT CTACAACGCC 
CGAAAACGGA ACAAGCATTA ACTTAACATT 

45 GTGATGACAC AGTTCGTACA GTTGCAATGT 
CGTGGCACAG AAGTAGAAGA TACTGGTAAA 
TGATGCAACA CTTGGTCGTG TATTTAACGT 
TAGATGGTGA GGTTCCTGCG GATGTACGTC 
GCACCTGCAT TCGAAGAATT ATCTACTAAA 

50 TATTAAAGTA GTAGACTTAC TTGCTCCTTA 
GTCTATTCGG TGGTGCCGGT GTAGGTAAAA 
ATCAATAACA TCGCACAAGA ACACGGTGGT 
AGGTGAGCGT ACTCGTGAGG GTAATGACTT 



CTTACGGTAA AACAGAGCAA 50 

TGAAGTTGCA CTTCATTTAG 100 

CTTCCACAGA TGGACTTGTT 150 

GCAATCTCTG TACCAGTTGG 200 

ATTAGGTGAT GCAATTGACT 250 

GTGATCCAAT TCACCGTCAA 300 

GTAGAAATTC TTGAAACTGG 350 

CATTAAGGGT GGTAAGATCG 400 

CGGTATTAAT TCAGGAATTA 4 50 

ATCTCTGTAT TCGCTGGTGT 500 

ATACCACGAA ATGAGCGATT 550 
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CTGGCGTAAT TAAGAAAACT GCGATGGTAT TCGGACAAAT GAACGAGCCA 
CCTGGAGCAC GTCAACGTGT TGCGTTAACA GGTTTAACAA TGGCTGAGCA 
TTTCCGTGAT GAGCAAGGAC AAGATGTACT TCTGTTCATC GATAATATCT 
TCCGTTTCAC GCAAGCAGGT TCTGAAGTAT CTGCCCTTCT TGGCCGTATG 
CCATCTGCGG TAGGTTACCA ACCAACACTT GCAACAGAAA TGGGTCAATT 
ACAAGAGCGT ATTACATCTA CAA 



600 
650 
700 
750 
800 
823 



2) INFORMATION FOR SEQ ID NO: 1266 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 715 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE : 

(A) ORGANISM: Paracoccidioides brasiliensis 

(B) STRAIN: ATCC 200443 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1266 

TGGTCCGAGR CCCGATTCMA TGAAATTATC AAGGAAACCT CCAAYTTCAT 50 

TAAGAAGGTC GGATATAACC CCAAGACTGT TCCTTTCGTT CCCATTTCTG 100 

GTTTCCAGGG TGACAACATG ATCGATSCCT CTGCCAACTG CCCATGGTAC 150 

AAGGGCTGGT ACMAKGAGAC TGCCGACAGG CAAGYACTCT GGCAAGACCC 200 

TTCTTGAGGC CATTGACGSC ATTGAGCCCC CCAMSCGTCC TWCCGATAAA 250 

CCTCTCCGTC TTCCTCTCCA GG AT GTCT AC AAGATCTCCG GTATTGGMAC 300 

TGTTCCTGTC GGACGTRTTG AGACTGGAGT CATCAAGCCC GGTATGGTCG 350 

TGACCTTCGC TCCCGCCAAC GTCACCACTG AAGTCAAGTC CGTTGAAATG 400 

CACCACCAGC AGCTTTCCGA CGGTAWCCCC GGTGACAACG TCGGCTTCAA 4 50 

CGTCAAGAAT GTTTCCGTCA AAGAAGTCCG CCGTGGTAAC GTTGCCTGGT 500 

GACTCTAAGA ATGATCCCGC MAWGGGCTGC GATTCCTTCA ATGCYCAGGT 550 

CATCGTCCTC AACCACCCTG GTCAGGTTGG CGCTGGTTAT GCCCCAGTCC 600 

TCGAYTGCCA TACTGCCCAC ATTGCYTGCA ARTTCGCTGA GMTCMAKGAG 650 

AAGATTGAYC GCCGAACCGG MAAGTCTGTT GAGAACGCCC CCAAGTTCAT 700 

CAAGTCCGGT GATGC 715 



2) INFORMATION FOR SEQ ID NO: 1267 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 875 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 
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(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Blastomyces dermatitidis 

(B) STRAIN: ATCC 56220 

5 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1267 

GAGTCCTCTT ATTTACTTTT GTCATGACTA CCTTACTAAT CTGTCATAGA 50 

TCGTTACAAC GAAAT CGTCA AGGAGACTTC CAACTTCATC AAGAAGGTCG 100 

GATACAACCC CAAGAACGTT CCTTTCGTTC CTATCTCCGG TTTCAACGGC 150 

10 GACAACATGC TTGAGCCCTC CCCCAACTGC CCCTGGTACA AGGGTTGGGA 200 

GAAGGAGACC AAGGCCGGTA AGGTCACTGG TAAGACCCTC CTCGAGGCCA 250 

TCGACGCCAT TGAGCCCCCT ACCCGTCCCG CCAACAAGGT CAGTACTACC 300 

TCAATTACTT GAACTCTCTT CATACGTTCC GATTACTGAC TGCTTCACAG 350 

CCCCTCCGTC TTCCCCTCCA GGACGTTTAC AAGATCGGTG GTATTGGAAC 400 

15 GGTGCCCGTC GGTCGTGTTG AGACCGGTAC CATCTCCCCT GGTATGGTCG 4 50 

TTACCTTGTA TGTATCCTGA CCATCCCCCT TGGCAATCAT TACGTACTAA 500 

CTCACTCTTC AGCGCTCCCG CCAACGTCAC CACTGAAGTC AAGAGTGTTG 550 

AAATGCACCA CCAGCAGCTC GCTGCCGGTC AGCCCGGTGA CAACGTTGGT 600 

TTCAACGTGA AGAACGTCTC CGTCAAGGAA ATCCGTCGTG GTAACGTTGC 650 

20 TGGTGATAGC AAGAACGACC CCCCTGCCGG TGCTGCTTCC TTCAACGCCC 7 00 

AGGTCATCGT CCTCAACCAC CCCGGTCAGG TCGGTGCTGG TTACGCCCCA 7 50 

GTCCTTGACT GCCACACTGC CCACATTGCT TGCAAGTTCT CTGAACTCCT 800 

TGAGAAGATT GACCGTCGTA CCGGAAAGTC TGTTGAGGAC CACCCCAAGT 850 

TCATCAAGTC CGGTGACGCT GCCAT 875 

25 



30 



35 



2) INFORMATION FOR SEQ ID NO: 1268 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1124 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 



40 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Histoplasma capsulatum 

(B) STRAIN: WSA-377 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1268 



GTGAGCGTGG 
45 AAGTACAGTG 
ATTTCAACTT 
GCCATCGTGA 
• TGCGCTATCC 
CTCCAAGGAT 
50 GTGTGAGGCA 
TCCGAGTCCC 
GAAGGTCGGA 
TCGAGGGTGA 



TATCACCATC 
TCACTGTCAT 
CCAGAGTATC 
CTTCATCAAG 
TCATCATTGC 
GGCCAGACTC 
ACTCATCGTT 
GTTTCAACGA 
TATAACCCCA 
CAACATGATT 



GATATTGCCC 
TGGTGAGTGC 
TACTCTAACA 
AACATGATCA 
TGCCGGTACT 
GTGAGCACGC 
GCCATCAACA 
AAT CATCAAG 
AGGCTGTTCC 
GAACCCTCCC 



TCTGGAAATT 
TTTTTACCCC 
TATCCGCTTA 
CTGGTACCTC 
GGTGAGTTCG 
TCTGCTTGCT 
AGATGGACAC 
GAGGTTTCCA 
CTTCGTGCCA 
CCAACTGCAC 



CGAGACCCCG 50 

TCTTAAGCAG 100 

GATGCTCCCG 150 

CCAGGCTGAC 200 

AGGCTGGTAT 250 

TTCACCCTTG 300 

CACCAAGTGG 350 

ACTTCATCAA 400 

ATCTCTGGTT 4 50 

CTGGTACAAG 500 
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GGCTGGAACA AGGAGACTGC CTCTGGCAAG TCTTCTGGTA AAACCCTTCT 550 

CGATGCCATT GACGCCATTG AACCCCCAAC CCGTCCTACC GATAAGCCTC 600' 

TCCGTCTTCC CCTCCAGGAT GTTTACAAAA TCTCTGGTAT TGGCACTGTT 650 

CCCGTCGGAC GTGTTGAGAC TGGTGTCATC AAGCCCGGTA TGGTCGTGAC 700 

5 TTTCGCTCCC TCCAACGTCA CCACTGAAGT CAAGTCCGTC GAGATGCACC 750 

ACCAACAACT CCAGGCTGGT TACCCTGGTG ACAACGTCGG CTTCAACGTC 800 

AAGAACGTTT CAGTCAAGGA AGTCCGCCGT GGCAACGTTG CTGGCGACTC 850 

CAAAAATGAT CCCCCCAAGG GCTGCGAATC CTTCAATGCC CAGGTCATCG 900 

TCCTTAACCA CCCCGGCCAG GTTGGCGCTG GTTATGCCCC AGTCCTCGAC 950 

10 TGCCACACTG CCCACATTGC TTGCAAGTTC TCTGAACTCA TTGAGAAGAT 1000 

CGACCGCCGT ACTGGAAAGT CTGTTGAGAA CAACCCCAAG TTCATCAAGT 1050 

CTGGTGATGC TGCTATCGTC AAGATGGTTC CCTCCAAGCC CATGTGCGTG 1100 

GAGCCCTTCA CTGACTATCC CCCT 1124 



15 



30 



2) INFORMATION FOR SEQ ID NO: 1269 



(i) SEQUENCE CHARACTERISTICS: 
20 (A) LENGTH: 1043 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

25 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Trichophyton rubrum 

(B) STRAIN: WSA-224 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1269 



GTGAGCGTGG TATCACCATC GATATCGCCC TCTGGAAGTT CGAGACCCCC 50 

AAGTACAATG TCACCGTCAT TGGTATGTTT CTTTGCCTTG TTCCCTCATG 100 

35 TGGTTGTACC ATATCTAACG AGAGTAGACG CCCCCGGTCA CCGTGACTTC 150 

ATCAAGAACA TGATCACTGG TACCTCCCAG GCTGACTGCG CTATTCTCAT 200 

CATTGCTGCC GGTACTGGTG AGTTCGAGGC TGGTATCTCC AAGGATGGCC 250 

AGACCCGTGA GCACGCTCTG CTCGCCTTCA CCCTCGGTGT CAAGCAGCTC 300 

ATCGTTGCCA TCAACAAGAT GGACACCACC GGCTGGTCCG AGGATCGTTT 350 

4 0 CAAGGAAATT ATCAAGGAAG TCACCAACTT CATCAAGAAG GTTGGCTACG 4 00 

ACCCCAAGGG TGTTCCATTC GTTCCAATCT CTGGTTTCAA CGGTGACAAC 450 

ATGATTGAGG CCTCCACCAA CTGCCCATGG TACAAGGGAT GGAACAAGGA 500 

GACCAAGGCC GGTGGTGCCA AGTCCGGCAA GACCCTCCTC GAGGCCATCG 550 

ATGCCATCGA CATGCCAACC CGTCCTACCG ACAAGCCCCT CCGTCTCCCA 600 

45 CTCCAGGATG TCTACAAGAT CTCTGGTATC GGAACTGTGC CAGTCGGTCG 650 

TGTTGAGACC GGTATCATCA AGCCCGGTAT GGTCGTCACC TTNGCCCCCG 7 00 

CCAACGTCAC CACTGAAGTC AAGTCCGTYK AAATGCACCA CCAGCAGCTT 750 

CAGCAGGGTG TCCCCGGTGA CAACGTCGGC TTCAATGTCA AGAACGTTTC 800 

CGTCAAGGAA GTCCGCCGTG GTAACGTTGC CGGTGACTCC AAGAACGACC 850 

50 CACCATCCGG CTGTGCCTCC TTCAACGCCC AGGTCATYGT CCTCAACCAC 900 

CCCGGCCAGA TCGGTGCTGG TTACGSTCCA GTCCTCGACT GCCACACTGS 950 

TCACATTGCT TGCAAGTTCG CTGAGCTCCT CGAGAAGATT GACCGCCGTA 1000 

CCGGTAAATC CGTCGAAGCC AACCCCAAGT TCGTCAAGTC TGG 104 3 
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2) INFORMATION FOR SEQ ID NO: 1270 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1105 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

( D ) TOPOLOGY : Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Microsporum canis 

(B) STRAIN: WSA-217 



15 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1270 



PCT/CAOO/01150 



10 



GCTGAGCGTG 
GACCCCCAAG 
TTTATAGTTG 

20 TGACTTCATC 
TTCTCATCAT 
GATGGCCAGA 
GCAGCTCATC 
CCCGTTTCGG 

25 GGCTACGACC 
TGACAACATG 
ACAAGGAGAC 
GCCATCGATG 
TCTCCCACTC 

30 TCGGTCGTGT 
GCCCCCGCCA 
GCAGCTYGTC 
ACGTYTCTGT 
AACGACCCAC 

35 CAACCACCCC 
ACACTGCCCA 
CGCCGTACTG 
TGATGCCGCT 
CTTTC 

40 



AGCGTGGTAT 
TACATGGTCA 
CGACCAGTAA 
AAGAACATGA 
TGCTGCCGGT 
CTCGTGAGCA 
GTTGCCATCA 
TGAAATCATC 
CCAAGGGTGT 
ATTGAGCCCT 
CAAGGCCGGT 
CCATTGACAT 
CAGGATGTCT 
TGAGACTGGT 
ACGTCACCAC 
CAGGGTGTTC 
CAAGGAAGTC 
CAGCTGGCTG 
GGCCAGATCG 
CATTGCTTGC 
GTAAATCCGT 
ATTGCCACCA 



CACCATTGAT 
CCGTCATCGG 
CTAACAAAAA 
TTACTGGTAC 
ACTGGTGAGT 
CGCCCTGCTC 
ACAAGATGGA 
AAGGAAGTCA 
CCCATTCGTC 
CCACCAACTG 
GGCAAATCCT 
GCCCACTCGT 
ACAAGATCTC 
ATCATCAAGC 
TGAAGTCAAG 
CCGGTGACAA 
CGCCGTGGTA 
CGCCTCTTTC 
GTGCTGGTTA 
AAGTTCTCTG 
CGAAACCAGC 
TGGTTCCATC 



ATCGCCCTCT 
TATGCTTTAT 
GTAGATGCCC 
CTCCCAGGCC 
TCGAGGCTGG 
GCTTTCACCC 
CACCACCAAC 
CCAACTTCAT 
CCAATCTCTG 
CCCATGGTAC 
CTGGTAAGAC 
CCCACCGACA 
TGGTATCGGA 
CTGGTATGGT 
TCCGTCGAAA 
CGTTGGCTTC 
ACGTTGCCGG 
AAGGCCCAGG 
CGCCCCAGTC 
AGCTTCTTGA 
CCTAAGTTCG 
CAAGCCCATG 



GGAAGTTCGA 50 

CTGTTTCCCA 100 

CCGGGCACCG 150 

GACTGCGCTA 200 

TATCTCCAAG 250 

TCGGTGTCAA 300 

TGGTCTGAGT 350 

CAAGAAGGTC 4 00 

GCTTCAACGG 4 50 

AAGGGATGGA 500 

CCTCCTTGAG 550 

AGCCTCTCCG 600 

ACAGTACCAG 650 

TGTCACTTTY 700 

TGCACCACCA 7 50 

AACGTCAAGA 800 

TGATTCCAAG 8 50 

TCATCGTCCT 900 

CTTGACTGCC 950 

GAAGATTGAC 1000 

TCAAGTCTGG 1050 

TGCGTTGAGG 1100 
1105 



45 



50 



2) INFORMATION FOR SEQ ID NO: 1271 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1244 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 
(vi) ORIGINAL SOURCE: 
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(A) ORGANISM: Aspergillus versicolor 

(B) STRAIN: WSA-175 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1271 

5 

GAGCGTGAGC GTGGTATCAC CATCGATATC GCTCTCTGGA AGTTCCAGAC 50 

CCCTAAGTAT GAGGTCACCG TCATTGGTAT GTTGTCCTTC TTGTGTTACC 100 

• ATCGAAACAT ATCTAACCTA CAACTGCAGA CGCCCCCGGT CACCGTGACT 150 

TCATCAAGAA CATGATCACT GGTACCTCCC AGGCCGACTG CGCTATTCTC 200 

10 ATCATTGCTT CCGGTACTGG TGAATTCGAG GCTGGTATCT CCAAGGATGG 2 50 

CCAGACCCGT GAGCACGCTC TGCTCGCTTT CACCCTCGGT GTCCGTCAGC 300 

TCATCGTTGC CCTCAACAAG ATGGACACTG CTGGCTGGGC TGAGGCTCGT 350 

T AC AAC G AAA TCGTCAAGGA AACTTCCGGT TTCATCAAGA AGGTCGGCTA 400 

CAACCCCAAG TCGGTTCCCT TCGTCCCCAT CTCCGGTTTC AACGGTGACA 4 50 

15 ACATGCTTGA GCCCTCCTCC AACTGCCCCT GGTACAAGGG TTGGGAGAAG 500 

GAGACCAAGG CTGGTAAGGC CACTGGTAAG ACCCTCCTCG AGGCCATCGA 550 

CGCCATTGAG CCTCCCGTCC GTCCCTCCAA CAAGCCTCTC CGTCTTCCCC 600 

TCCAGGATGT CTACAAGATC TCTGGTATTG GAACTGTCCC CGTCGGCCGT 650 

GTCGAGACCG GTACCATCGT CCCCGGTATG GTCGTCACCT TCGCTCCCGC 700 

20 CAACGTCACC ACTGAAGTCA AGTCCGTTGA GATGCACCAC CAGCAGCTCA 7 50 

AGGAGGGTGT TCCCGGTKAC AACGTTGGTT TCAACGTGAA GAACGTTTCC 8 00 

GTCAAGGAAG TCCGCCGTGG TAACGTCGCT GGTGACTCCA AGAACGACCC 850 

CCCTGCCGGT GCTGCCTCTT TCACCGCCCA GGTCATCGTC CTCAACCACC 900 

CCGGTCAGGT CGGCGCTGGT TACGCTCCCG TCCTCGACTG CCACACCGCT 950 

25 CACATTGCCT GCAAGTTCGC TGAGCTCCAG GAGAAGATCG ACCGCCGTAC 1000 

CGGAAAGTCT GTCGAATYTG CCCCCAAGTT CATCAAGTCT GGTGACGCCG 1050 

CTATCGTCAA GATGATTCCC TCCAAGCCCA TGTGTGTCGA GTCTTTCACT 1100 

GACTACCCTC CTYTCGGCCG TTTCGCCGTC CGTGACGTAA GTTCTTTCCC 1150 

CAGCTTTTCG ATGCTACCCT TCTMTGAATC ACGTGTCATG TCTTGGCACC 1200 

3 0 CGCCCATCAC ATGACCACGC AACCCTATAC CCCGCCACAC CCTT 1244 



2) INFORMATION FOR SEQ ID NO: 1272 



35 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1032 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
40 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
45 {A) ORGANISM: Exophiala moniliae 

(B) STRAIN: WSA-219 

{xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1272 

50 GCTGAGCGTG AGCGTGGTAT CACCATCGAT ATCGCTCTCT GGAAGTTCGA 50 

GACCCCCAAG TACTATGTCA CCGTCATCGA CGCCCCCGGT CATCGTGACT 100 

TCATCAAGAA CATGATCACT GGTACTTCCC AAGCTGACTG CGCCATTCTC 150 

ATCATTGCTG CCGGTACTGG TGAATTCGAA GCCGGTATCT CCAAGGATGG 200 
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TCAGACCCGT GAGCACGCTC TGCTTGCCTA 
TCATTGTCGC CATCAACAAG ATGGACACTA 
TTCAACGAAA TCATCAAGGA GACTTCCAGC 
CAACCCCAAG TCCGTTCCTT TCGTCCCCAT 
5 ACATGATCGA CGTCTCCACC AACTGCCCCT 
GAGACCAAGG CTGGCAAGGC CTCTGGCAAG 
CGCCATTGAC CCCCCCTCTC GTCCCACCGA 
TCCAGGATGT GTACAAGATC TCTGGTATCG 
GTCGAGACTG GTATCATCAA GGCCGGTATG 

10 CAACGTCACC ACTGAAGTCA AGTCCGTCGA 
CCGAGGGTGT TCCAGGTGAC AACGTCGGTT 
GTCAAGGAGG TTCGTCGTGG AAACGTTTGC 
ACCCAAGGGC GCTGATTCCT TCAACGCCCA 
CTGGTCAAGT TGGTGCTGGC TACGCCCCAG 

15 CACATTGCTT GCAAGTTCTC TGAGCTTCTC 
CGGAAAGTCG ATCGAAAACA ACCCCAAGTT 
CCATCGTCAA GATGGTTCCC AGCAAGCCCA 



PCT/CA00/01150 

CACCCTGGGT GTCAAGCAGC 250 
CCAAGTGGTC TGAGGACCGT 300 
TTCATCAAGA AGGTCGGCTA 350 
CTCCGGCTTC AACGGTGACA 400 
GGTACAAGGG CTGGGAGAAG 4 50 

ACTCTCCTTG AGGCCATCGA 500 
CAAGCCTYTC CGTCTCCCTC 550 
GAACGGTGCC CGTCGGTCGT 600 
GTCGTTACCT TCGCTCCTGC 650 
AATGCACCAC GAACAGCTCG 700 
TCAACGTCAA GAACGTYTCC 750 
GGTGACTCCA AGAACGACCC 800 
GGTCATCGTC TTGAACCACC 850 
TGTTGGATTG CCACACTGCC 900 
GAGAAGATCG ATCGTCGKAC 950 

CATCAAGTCT GGTGACGCTG 1000 

TG 1032 



20 



2) INFORMATION FOR SEQ ID NO: 1273 



(i) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 1106 bases 
25 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



30 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Hortaea werneckii 

(B) STRAIN: ATCC 34944 

35 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1273 

TAACAGCCCG TGCGACCGTC TCGTCCAATA ACTAATGATT CGCAGGTACT 50 

ATGTCGTGAG TATCCGGTCC TTTTTTGTTA ATTTACCAGA AATGACGAAA 100 

TTTTTGACTA ATTAACACAC TCAGACCGTC ATTGACGCCC CGGGTCACCG 150 

40 TGACTTCATC AAGAACATGA TCACTGGTAC CTCCCAGGCC GACTGCGCTG 200 

TCCTCATCAT TGCTGCCGGT ACTGGTGAGT TCGAGGCTGG TATCTCCAAG 250 

GATGGCCAGA CCCGTGAGCA CGCCCTGCTC GCCTACACCC TCGGTGTCAA 300 

GCAGCTCATC GTCGCCATCA ATAAGATGGA. CACCACCAAG TGGTCCGAGG 350 

AGCGTTACGG CGAGATCATC AAGGAGACCT CTGCCTTCAT CAAGAAGGTC 4 00 

45 GGTTTCAACC CGAAGCACGT CCCGTTCGTC CCGATCTCCG GTTTCAACGG 4 50 

TGACAACATG ATCGAGGCCT YCACCAACTG CCCGTGGTAC AAGGGCTGGG 500 

AGAAGGAGAC CAAGGCCAAG GTCACCGGCA AGACCCTYCT TGAGGCCATT 550 

GACAACATCG ACCCGCCGAG CCGTCCTTCC GACAAGCCGC TCCGTCTTCC 600 

CCTCCAGGAT GTCTACAAGA TCGGTGGTAT TGGGACAGTC CCAGTCGGCC 650 

50 GTGTCGAGAC CGGTACCATC AAGGCCGGCA T GGTCGTT AC CTTCGCTCCG 700 

GCTGGTGTCA CCACTGAAGT GAAGTCCGTT GARATGCACC ACGAGCAGCT 750 

CGCTGAGGGT YTGCCGGGTG ACAACGTCGG CTTCAACGTC AAGAACGTTT 800 

CCGTCAAGGA GATCCGTCGT GGCAACGTTG CTGGTGACAG CAAGGCTGAC 850 
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CCGCCGAAGG GCTGCGACAG CTTCAACGCC 
CCCTGGCCAG GTCGGTGCTG GTTACGCTCC 
CCCACATTGC CTGCAAGTTC GGCGAGCTCC 
TYTGGCAAGT CCATTGAAGC CTYGCCTAAG 
5 TGCCATYGTC AAGATGATTC CGTCCAAGCC 
CTGAGT 



PCT/CA00/01150 

CAGGTCATCG TCCTGAACCA 900 
AGTCCTGGAC TGCCACACTG 950 
TCGAGAAGAT CGACCGTCGC 1000 
TACATCAAGT CTGGTGACGC 1050 
GATGTGCGTT GAGCCATTCA 1100 

1106 



10 2) INFORMATION FOR SEQ ID NO: 1274 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1119 bases 

(B) TYPE: Nucleic acid 
15 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

20 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Fusarium solani 

(B) STRAIN: ATCC 32793 



25 



50 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1274 



CTCAAGGCCG AGCGTGAGCG TGGTATCACC ATCGACATTG CCCTCTGGAA 50 

GTTCGAGACT CCCCGCTACT ATGTCACCGT CATTGGTATG TTGCTGTCAC 100 

CTCTCTCACA CATGTCTCAC CACTAACAAT CAACAGACGC CCCCGGCCAC 150 

CGTGATTTCA TCAAGAACAT GATCACTGGT ACTTCCCAGG CCGACTGCGC 200 

3 0 CATTCTCATC ATTGCCGCCG GTACTGGTGA GTTCGAGGCT GGTATCTCCA 250 

AGGATGGCCA GACCCGTGAG CACGCCCTGC TCGCCTACAC CCTCGGTGTC 300 

AAGAACCTCA TTGTCGCCAT CAACAAGATG GACACCACCA AGTGGTCCGA 350 

GTCCCGTTAC CAGGAGATCA TCAAGGAGAC CTCCAACTTC AT CAAG AAGG 4 00 

TCGGCTACAA CCCCAAGGCT GTCGCTTTCG TCCCCATCTC CGGTTTCAAC 4 50 

35 GGCGACAACA TGCTTACTCC CTCCACCAAC TGCCCCTGGT ACAAGGGCTG 500 

GGAGCGTGAG ATCAAGTCCG GCAAGCTCAC TGGCAAGACC CTCCTCGAGG 550 

CCATTGACTC CATCGAGCCC CCCAAGCGTC CCGTCGACAA GCCCCTYCGA 600 

CTTCCCCTCC AGGATGTCTA CAAGATCGGT GGTATTGGCA CGGTTCCCGT 650 

CGGCCGTATC GAGACTGGTG TCATCAAGCC CGGTATGGTC GTTACCTTMG 7 00 

4 0 CCCCCTCCAA CGTCACCACT GAAGTCAAGT CCGTCGARAT GCACCACGAG 750 

CAGCTCTYTG AGGGTCTTCC CGGTGACAAC GTCGGCTTCA ACGTGAAGAA 800 

CGTYTCCGTC AAGGAGATCC GACGTGGCAA CGTCGCTGGT GACTCCAAGA 850 

ACGACCCCCC TYTGGGTGCC GCCTCTTTCA CCGCCCAGGT CATTGTCCTC 900 

AACCACCCTG GCCAGGTCGG TGCCGGTTAC GCCCCCGTTY TGGACTGCCA 950 

4 5 CACTGCCCAC ATTGCCTGCA AGTTCGCCGA GATCCAGGAG AAGATCGACC 1000 

GCCGAACTGG TAAGGCTGTT GAGTCCGCCC CCAAGTTCAT CAAGTCTGGT 1050 

GACTCCGCCA TCGTCAAGAT GGTTCCCTCC AAGCCCATGT GCGTTGAGGC 1100 

TTTCACTGAC TACCCCCCT 1119 



2) INFORMATION FOR SEQ ID NO: 1275 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1113 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
5 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
10 (A) ORGANISM: Aureobasidium pullulans 

(B) STRAIN: WSA-234 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 127 5 

15 CTGAAGTCTG AGCGTGAGCG TGGTATCACT ATCGATATCG CCCTCTGGAA 50 

GTTCGAGACC CCCAAGTACA TGGTCACCGT CATCGATGCC CCCGGTCACC 100 

GTGATTTCAT CAAGAACATG ATCACTGGTA CCTCCCAGGC TGACTGCGCC 150 

ATTCTCATCA TTGCTGCCGG TACTGGTGAG TTCGAGGCTG GTATCTCCAA 200 

GGATGGCCAG ACTCGTGAGC ACGCTCTCCT CGCCTACACC CTTGGTGTCA 2 50 

2 0 AGCAGCTCAT CGTTGCCATC AACAAGATGG ACACCACCAA GTGGTCTGAG 300 

GCCCGTTACC AGGAGATCAT CAAGGAGACC TCCGGTTTCA TCAAGAAGGT 350 

CGGCTACAAC CCCAAGCACG TTCCCTTCGT CCCCATCTCC GGCTTCAACG 4 00 

GAGACAACAT GATCGAGGTC AGCTCCAACT GCCCTTGGTA CAAGGGTTGG 4 50 

GAGAAGGAGA CCAAGGCCAA GGCCACTGGC AAGACCCTCC TCGAGGCCAT 500 

25 TGACGCCATC GACCCTCCTT CCCGTCCTAC CGACAAGCCC CTCCGTCTTC 550 

CCCTCCAGGA TGTCTACAAG ATCGGTGGTA TTGGCACGGT GCCCGTCGGT 600 

CGTGTTGAGA CCGGTAAGAT CATGGGTGGT ATGGTTGTCA CCTTCGCCCC 650 

CGCTGGTGTC ACCACCGAGG TCAAGTCCGT CGAGATGCAC CACGAGCAGC 700 

• TCACCGAGGG TCTT.CCCGGT GACAACGTCG GCTTCAACGT CAAGAACGTY 7 50 

30 TCCGTCAAGG AGATCCGTCG TGGCAACGTT GCCGGTGACT CCAAGAACGA 800 

CCCCCCCAAG GGTTGTGACT CCTTCAACGC CCAGGTCATC GTCCTGAACC 850 

ACCCTGGTCA GGTCGGTGCT GGTTACGCAC CCGTCCTCGA CTGCCACACT 900 

GCCCACATTG CTTGCAAGTT YTCCGAGCTT GTTGAGAAGA TTGACCGCAG 950 

AACCGGCAAG TCCGTTGAGG CTGCCCCCAA GTTCATCAAG TCTGGTGACG 1000 

35 CCGCCATCGT CAAGATGGTT CCCTCCAAGC CCATGTGTGT TGAGGCTTTC 1050 

ACCGACTACC CTCCTYTCGG TCGTTTCGCC GTCCGGTATG TTTTTCTTCT 1100 

TCAATCATTC TTC 1113 



40 



50 



2) INFORMATION FOR SEQ ID NO: 1276 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 731 bases 
45 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Blastomyces dermatitidls 

(B) STRAIN: ATCC 14112 

691 
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<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1276 

GGTCTGAGAC CCGTTTCAAC GAAATTATCA AGGAAGTCAG CAACTTCATC 50 

5 AAGAAGGTCG GATACAACCC CAAGTCTGTT CCCTTCGTGC CAATCTCCGG 100 

TTTCGAGGGT GACAACATGA TTGAGCCTTC CCCCAACTGC CCTTGGTACA 150 

AGGGCTGGAA CAAGGAGACT GCTGCTGGCA AGGCCGCCGG TAAGACTCTT 200 

CTCGATGCCA TTGACGCCAT CGACCCCCCC GTCCGTCCTA CCGAGAAGCC 250 

TCTCCGTCTT CCCCTCCAGG ATGTGTACAA GATCTCCGGT ATTGGCACTG 300 

10 TTCCCGTTGG ACGTGTCGAG ACTGGTGTCA TCAAGCCTGG TATGGTCGTG 350 

ACCTTCGCCC CTGCCAACGT CACCACTGAA GTCAAGTCCG TTGAAATGCA 400 

CCACCAGCAG CTCCAGGCTG GTTACCCTGG TGACAACGTC GGCTTCAACG 450 

TCAAGAACGT TTCCGTCAAG GAAGTCCGCC GTGGCAACGT TGCCGGTGAC 500 

TCCAAGAACG ACCCCCCCAA GGGCTGCGAG TCCTTCAACG CCCAGGTCAT 550 

15 CGTCCTCAAC CACCCCGGCC AGGTTGGCGC TGGTTATGCC CCAGTCCTCG 600 

ACTGCCACAC TGCCCACATT GCTTGCAAGT TCTCTGAGCT CATCGAGAAG 650 

ATTGACCGCC GTACCGGAAA GTCTGTTGAG GACAACCCCA AGTTCATCAA 700 

GTCCGGTGAT GCTGCTATCG TCAAGATGAT T 731 



20 



35 



2 ) INFORMATION FOR SEQ ID NO: 1277 



(i) SEQUENCE CHARACTERISTICS: 
25 (A) LENGTH: 104 6 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

30 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Exophiala dermatitidis 

(B) STRAIN: ATCC 7 6088 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1277 



CCGAGCGTGA GCGTGGTATC ACCATCGATA TCGCCCTCTG GAAGTTCGAG 50 

ACCCCCAAGT ACTATGTCAC CGTCATCGAC GCCCCGGGTC ATCGTGACTT 100 

40 TATCAAGAAC ATGATCACTG GTACCTCGCA GGCCGACTGC GCCATCTTGA 150 

TCATTGCCGC CGGTACCGGT GAATTCGAAG CCGGTATCTC CAAGGATGGT 200 

CAGACCCGTG AGCACGCTCT GCTCGCCTAC ACCTTGGGTG TCAAGCAGCT 250 

CATCGTCGCC ATCAACAAGA TGGACACCAC CAAGTGGTCC GAGGAGCGTT 300 

TCAACGAAAT CATCAAGGAG ACTTCCAACT TCATCAAGAA GGTCGGCTAC 350 

45 AACCCCAAGG CCGTTCCTTT CGTCCCCATC TCCGGCTTCA ACGGTGACAA 400 

CATGATTGAG GTCTCCACCA ACTGCCCGTG GTACAAGGGA TGGGAGAAGG 450 

AGTCCAAGGC TGGCAAGGCC ACCGGCAAGA CCCTCCTCGA GGCCATTGAC 500 

GCCATCGACC CACCCACCCG TCCCACCGAC AAGCCTCTCC GTCTCCCTCT 550 

CCAGGATGTC TACAAGATCT CTGGTATCGG AACGGTTCCT GTCGGTCGTG 600 

50 TCGAGACCGG TACCATCAAG GCCGGTATGG TCGTCACCTT CGCTCCGGCC 650 

AACGTCACCA CTGAAGTCAA GTCCGTCGAA ATGCACCACG AGCAGCTCGC 700 

CGAGGGTYTG CCAGGTGACA ACGTTGGCTT CAACGTCAAG AACGTYTCCG 750 

TCAAGGAGGT TCGTCGTGGT AACGTTGCCG GTGACTCCAA GAACGACCCG 800 

692 
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CCCAAGGGTG CCGAGTCCTT CAACGCCCAG 
TGGTCAGATC GGTGCCGGCT ACGCTCCAGT 
ACATTGCTTG CAAGTTCGCC GAGTTGCTCG 
GGAAAGTCCA TCGAGAACAA CCCCAAGTTC 
5 CATCGTCAAG ATGATTCCCA GCAAGCCCAT 



GTCATTGTCC TCAACCACCC 850 

CTTGGATTGC CACACTGCCC 900 

AGAAGATCGA CCGTCGTACC 950 

ATCAAGTCTG GTGATGCTGC 1000 

GTGTGTCGAG GCTTTC 104 6 



2) INFORMATION FOR SEQ ID NO: 1278 



10 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1109 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
15 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
20 (A) ORGANISM: Fusarium moniliforme 

(B) STRAIN: WSA-213 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1278 

25 CGTGAGCGTG GTATCACCAT CGATATTGCT CTCTGGAAGT TCGAGACTCC 50 

TCGCTACTAT GTCACCGTCA TTGGTATGTT GTCGCTCATA CCTCATCCTA 100 

CTTCCTCATA CTAACACATC ATTCAGACGC TCCCGGTCAC CGTGATTTCA 150 

T C AAGAAC AT GATCACTGGT ACTTCCCAGG CCGATTGCGC CATTCTCATC 200 

ATTGCCGCCG GTACTGGTGA GTTCGAGGCT GGTATCTCCA AGGATGGCCA 250 

30 GACCCGTGAG CACGCTCTTC TTGCCTACAC CCTTGGTGTC AAGAACCTCA 300 

TCGTCGCCAT CAACAAGATG GACACCACCA AGTGGTCTGA GGCCCGTTAC 350 

CAGGAGATCA TCAAGGAGAC CTCCTCTTTC ATCAAGAAGG TCGGCTACAA 400 

CCCCAAGGCT GTCGCTTTCG TCCCCATCTC CGGTTTCAAC GGTGACAACA 450 

TGCTTACCCC CTCCACCAAC TGCCCCTGGT ACAAGGGTTG GGAGCGTGAG 500 

35 ATCAAGTCCG GCAAGCTCTC CGGCAAGACC CTCCTCGAGG CCATTGACTC 550 

CATCGAGCCT CCCAAGCGTC CCGTTGACAA GCCCCTCCGT CTTCCCCTCC 600 

AGGATGTCTA CAAGATCGGT GGTATTGGAA CGGTTCCCGT CGGCCGTATT 650 

GAGACTGGTG TCATCAAGCC CGGTATGGTC GTTACCTTCG CTCCCTCCAA 7 00 

CGTCACCACT G AAGT C AAGT CCGTCGARAT GCACCACGAG CAGYTCAGTK 7 50 

40 AGGGCCAGCC CGGTGACAAC GTTGGTTTCA ACGTGAAGAA CGTTTCCGTC 800 

AAGGACATCC GACGTGGTAA CGTCGCTGGT GACTCCAAGA ACGACCCCCC 850 

CCAGGGTGCC GCTTCTTTCA CCGCCCAGGT CATCGTCCTC AACCACCCCG 900 

GCCAGGTCGG TGCTGGTTAC GCTCCTGTCC TCGATTGCCA CACTGCCCAC 950 

ATTGCCTGCA AGTTCGCCGA GATCCAGGAG AAGATCGACC GCCGAACCGG 1000 

45 TAAGGCTACT GAGGCCGCTC CCAAGTTCAT CAAGTCTGGT GACTCCGCCA 1050 

TCGTCAAGAT GGTTCCCTCC AAGCCCATGT GTGTCGAGGC TTTCACTGAC 1100 

TACCCTCCT 1109 



50 

2) INFORMATION FOR SEQ ID NO: 1279 

(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 7 65 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

5 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Aspergillus terreus 
10 (B) STRAIN: WSA-174 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 127 9 

CCTGCAAGTG GTCTGAGGAC CGTTACAACG AAATCGTGAA GGAGACCTCC 50 

15 AACTTCATCA AGAAGGTCGG CTACAACCCC AAGGCCGTTC CCTTCGTCCC 100 

CATCTCCGGT TTCAACGGTG ACAACATGCT TGAGCCTTCC CCCAACTGCC 150 

CCTGGTACAA GGGTTGGGAG AAGGAGGGCA AGTCCGGCAA GGTCACCGGT 200 

AAGACTCTCC TCGAGGCCAT CGATGCCATC GAGCCCCCCG TCCGTCCTGC 250 

CAACAAGCCT CTCCGTCTTC CCCTCCAGGA TGTGTACAAG ATCTCTGGTA 300 

20 TCGGAACTGT CCCCGTCGGC CGTGTCGAGA CTGGTGTCAT CACCCCCGGC 350 

ATGGTTGTTA CCTTCGCTCC TTCCAACGTC ACCACTGAAG TGAAGTCCGT 400 

TGAGATGCAC CACCAGCAGC TCAAGGAGGG TCTCCCCGGT GACAACGTTG 4 50 

GTTTCAACGT CAAGAACGTC TCCGTCAAGG AGGTCCGTCG TGGTAACGTC 500 

GCTGGTGACT CCAAGAACGA CCCCCCTGCT GGCGCTGCCT CCTTCACCGC 550 

25 CCAGGTCATC GTTCTCAACC ACCCCGGTCA GGTCGGCGCT GGCTACGCCC 600 

CCGTCCTCGA CTGCCACACT GCCCACATTG CCTGCAAGTT CGCTGAGCTC 650 

CAGGAGAAGA TTGACCGCCG TACCGGAAAG TCTGTTGAGT CTTCTCCCAA 700 

GTTCATCAAG TCTGGTGATG CTGCCATCGT CAAGATGATC CCTYCAAAGC 750 

CCATGTGCGT CGAAG 765 

30 



2) INFORMATION FOR SEQ ID NO: 1280 

35 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1105 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



40 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Aspergillus fumigatus 
45 (B) STRAIN: ATCC 6474 6 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1280 

GCGTGGTATC ACCATCGACA TTGCCCTCTG GAAGTTCCAG ACTCCCAAGT 50 

50 AT GAGGT CAC TGTCATCGGT AAGCTCGACT CGCCCCGATA TGTTTTGGTG 100 

CTGTAGCTAA CACGATCTGA AGATGCCCCC GGTCACCGTG ACTT CATC AA 150 

GAACATGATC ACTGGTACCT CCCAGGCTGA CTGCGCTATC CTCATCATTG 200 

CCTCCGGTAC TGGTGAGTTC GAGGCTGGTA TCTCCAAGGA TGGCCAGACC 250 

694 
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CGTGAGCACG CTCTGCTGGC TTTCACCCTC GGTGTCAAGC AGCTCATCGT 300 

CGCCCTCAAC AAGATGGACA CCTGCAAGTG GTCCGAGGAT CGTTACAACG 350 

AAATTGTCAA GGAAACCTCC AACTTCATCA AGAAGGTCGG CTACAACCCC 400 

AAGGCCGTTC CCTTCGTMCC CATCKCTGGC TTCAACGGTG AC AACAT GCT 450 

5 TGAGCCCTCC TCCAACTGCC CCTGGTACAA GGGATGGGAG AAGGAGACCA 500 

AGGCCGGCAA GGTCACTGGT AAGACCCTCA TCGAGGCCAT YGACGCCATT 550 

GAGCCCCCTG TCCGTCCCTC CAACAAGCCC CTYCGTCTTC CCCTCCAGGA 600 

TGTYTACAAG ATCTCTGGTA TCGGAACGGT CCCTGTCGGC CGTGTCGAGA 650 

CCGGTATCAT CAAGCCCGGC ATGGTCGTCA CCTTTGCCCC CGCCAACGTC 700 

10 ACCACTGAAG TCAAGTCCGT CGAAATGCAC CACCAGCAGC TCCAGGAGGG 750 

TGTCCCCGGT GACAACGTCG GTTTCAACGT CAAGAACGTT TCCGTCAAGG 800 

AAGTCCSCCG TGGTAACGTT TGCGGTGACT CCAAGAACGA TCCCCCTCAG 850 

GGTGCTGCCT CCTTCAACGC CCAGGTCATC GTCCTCAACC ACCCCGGTCA 900 

GGTCGGCGCT GGTTACGCCC CCGTCCTCGA CTGCCACACT GCCCACATTG 950 

15 CTTGCAAGTT CTYTGAGCTG CTTGAGAAGA TTGACCGCCG TACCGGCAAG 1000 

TCTGTTGAGA ACAACCCCAA GTTCATCAAG TCCGGTGAWG CCGCCATCGT 1050 

GAAGATGGTT CCTTCCAAGC CCATGTGTGT CGAGTCCTTC ACTGACTACC 1100 

CCCCT 1105 



20 



35 



2) INFORMATION FOR SEQ ID NO: 1281 



(i) SEQUENCE CHARACTERISTICS: 
25 (A) LENGTH: 134 3 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

30 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Cryptococcus laurentii 

(B) STRAIN: ATCC 44096 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1281 



GCCGAGCGAG AGCGAGGTAT CACCATCGAC ATTGCTCTCT GGAAGTTCGA 50 

GACCCCCAAG TACAACGTCA CCGTCATTGA CGCCCCCGGA CACCGAGACT 100 

4 0 TCATCAAGAA CATGATCACC GGTACCTCCC AGGCCGACTG CGCCATCCTT 150 

ATCATTGCCA CCGGTATCGG AGAGTTCGAG GCCGGTATCT CTAAGGACGG 200 

TCAGACCCGA GAGCACGCTT TGCTCGCCTT CACCCTCGGT GTCAGGCAGC 250 

TCATCGTTGC TTGCAACAAG ATGGACACCT GCAAGTGGTC CGAGGACCGA 300 

TTCAACGAGA TCGTCAAGGA GACCAACGGT TTCATCAAGA AGGTCGGATA 350 

45 CAACCCCAAG GCTGTTCCCT TCGTCCCCAT CTCTGGATGG CACGGAGACA 400 

AGTGAGTGCC GTTCTTTGCG TTGAGCCCTC TTTGTCGCTC CCCCTCCCTC 4 50 

TCAAGTGGCG GCGGCGGTCT CCACCCACAA ATCGGGTGGC GAATCCGCCA 500 

CACCCACCAC TTCTCGCCAC CGAGTGTGGC ACTTCTTCCA ACTCCTCTTT 550 

CCACTCCTCC TCGTCTCGTC TCTTTTTTTC TCCGTTGTCT TTGACAAGGG 600 

50 GAGTGTGCTG ATAGTAAAGC ATGCTTGAGG AGACCACCAA CATGCCGTGG 650 

TACAAGGGAT GGACCAAGGA GACCAAGTCC GGTGTCGTTA AGGGTAAGAC 700 

CCTCCTCGAC GCCATCGACG CCATCGAGCC TCCTCAACGA CCCACCGACA 750 

AGCCCCTCCG ACTTCCCCTC CAGGATGTCT ACAAGATCGG TGGTATCGGT 800 

695 
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ACGGTGCCCG TCGGCCGAGT CGAGACCGGT GTCATCAAGG CCGGGTAAGT 8 50 

CACGGGAGCC TGACGGCTGT TGTTGCCACA CCCAACTTAT ATCCAGTATG 900 

GTCGTCACCT TCGCTCCTAC CAACGTCACC ACTGAGGTTA AGTCCGTTGA 950 

GATGCACCAC GAGCAGATCC CTGAGGGTCT TCCCGGAGAC AACGTTGGTT 1000 

5 TCAACGTGAA GAACGTTTCC ATCAAGGACA TCCGACGAGG AAACGTYTGC 1050 

TCCGACTCCA AGAACGACCC CGCTAAGGAG GCCGCTTCTT TCAACGCCCA 1100 

GGTCATTGTC CTCAACCACC CTGGACAGAT TGGTGCCGGT TACACCCCCG 1150 

TCCTCGACTG CCACACCGCC CACATTGCCT GCAAGTTCGC CGAGCTCATC 1200 

GAGAAGATCG ACCGACGAAC TGGTAAGACC ATGGAGGCCG CCCCCAAGTT 1250 

10 CGTCAAGTCC GGAGACGCCG CCATTGTCAA GCTCGTTGCC CAGAAGCCCA 1300 

TCTGTGTCGA GTCTTACTCT GACTACCCTC CCCTTGGACG ATT 134 3 



15 2) INFORMATION FOR SEQ ID NO: 1282 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 734 bases 

(B) TYPE: Nucleic acid 
20 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

25 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Emmons ia parva 

(B) STRAIN: ATCC 10784 



30 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1282 



TGGTCCGAGG CTCGTTTCAA CGAAATCATC AAGGAAGTCA CCAACTTCAT 50 

CAAGAAGGTC GGATACAACC CCAAGTCTGT TCCCTTCGTG CCAATCTCTG 100 

GTTTCGAGGG TGATAACATG ATTGAGCCCT CCACCAACTG CCCCTGGTAC 150 

AAGGGCTGGA CCAAGGAGAC CGCTGCCGGC AAGTCAACCG GTAAGACTCT 200 

35 TCTTGACGCC ATTGATGCCA TCGATCAACC CTCCCGTCCT ACCGACAAGC 250 

CCCTCCGTCT TCCCCTCCAG GATGTGTACA AGATCTCCGG TATTGGCACT 300 

GTTCCCGTTG GACGTGTTGA GACTGGTATC ATCAAGCCTG GTATGGTCGT 350 

GACCTTCGCT CCCTCCAACG TCACCACTGA AGTCAAGTCC GTCGAAATGC 4 00 

ACCACCAACA GCTCTTGGCT GGTAACCCCG GTGACAACGT CGGTTTCAAC 450 

40 GTCAAGAACG TTTCCGTCAA GGAAGTCCGC CGTGGCAACG TTGCTGGTGA 500 

CTCAAAGAAC GACCCCCCCA AGGGCTGCGA CTCCTTCAAC GCCCAGGTCA 550 

TCGTCCTCAA TCACCCCGGT CAAGTTGGCG CTGGTTATGC CCCAGTCCTC 600 

GACTGCCACA CTGCCCACAT TGCTTGCAAG TTCTCTGAGC TCCTCGAGAA 650 

GATTGACCGC CGTACCGGAA AGTCCACTGA GAACAACCCC AAGTTCATCA 700 

45 AGTCTGGTGA CGCCGCTATC GTCAAGATGG TTCC 734 



2) INFORMATION FOR SEQ ID NO: 1283 

50 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1107 bases 

(B) TYPE: Nucleic acid 
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(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

5 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Fusarium solani 

(B) STRAIN: ATCC 62877 

10 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1283 

CGTGAGCGTG GTATCACCAT CGACATTGCC CTCTGGAAGT TCGAGACTCC 50 

CCGCTACTAT GTCACCGTCA TTGGTATGTT GCTGTCACCT CTCTCACACA 100 

TGTCTCACCA CTAACAATCA ACAGACGCCC CCGGCCACCG TGACTTCATC 150 

15 AAGAACATGA TCACTGGTAC TTCCCAGGCC GACTGCGCCA TTCTCATCAT 200 

TGCCGCTGGT ACTGGTGAGT TCGAGGCTGG TATCTCCAAG GATGGCCAGA 250 

CCCGTGAGCA CGCCCTGCTC GCCTACACCC TCGGTGTCAA GAACCTCATT 300 

GTCGCCATCA ACAAGATGGA CACCACCAAG TGGTCCGAGT CCCGTTACCA 350 

GGAGATCATC AAGGAGACCT CCAACTTCAT CAAGAAGGTC GGCTACAACC 400 

20 CCAAGGCTGT CGCTTTCGTC CCCATCTCCG GTTTCAACGG CGACAACATG 450 

CTTACTCCCT CCACCAACTG CCCCTGGTAC AAGGGCTGGG AGCGTGAGAT 500 

CAAGTCCGGC AAGCTCACTG GCAAGACCCT CCTCGAGGCC ATTGACTCCA 550 

TCGAGCCCCC CAAGCGTCCC GTCGACAAGC CCCTCCGACT TCCCCTCCAG 600 

GATGTCTACA AGATCGGTGG TATTGGCACG GTTCCCGTCG GCCGTATCGA 650 

25 GACTGGTGTC ATCAAGCCCG GTATGGTCGT TACCTTCGCC CCCTCCAACG 700 

TCACCACTGA AGTCAAGTCC GTCGAGATGC ACCACGAGCA GCTCTYTGAG 750 

GGTCTTCCCG GTGACAACGT CGGCTTCAAC GTKAAGAACG TYTCCGTCAA 800 

GGAGATCCGA CGTGGCAACG TCGCTGGTGA CTCCAAGAAC GACCCCCCTY 850 

TGGGTGCCGC CTCTTTCACC GCCCAGGTCA TTGTCCTCAA CCACCCTGGC 900 

30 CAGGTCGGTG CCGGTTACGC CCCCGTTYTG GACTGCCACA CTGCCCACAT 950 

TGCCTGCAAG TTCGCCGAGA TCCAGGAGAA GATCGACCGC CGAACTGGTA 1000 

AGGCTGTTGA GTCCGCCCCC AAGTTCATCA AGTCTGGTGA CTCCGCCATC 1050 

GTCAAGATGG TTCCCTCCAA GCCCATGTGC GTTGAGGCTT TCACTGACTA 1100 

CCCCCCT 1107 

35 



2) INFORMATION FOR SEQ ID NO: 1284 

40 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1045 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
<D) TOPOLOGY: Linear 



45 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Sporothrix schenckxi 
50 (B) STRAIN: ATCC 14285 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1284 



697 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 



PCT/CA00/01150 



GCTCAAGGCC GAGCGTGAGC GCGGTATCAC CATCGATATT GCTCTGTGGA 50 

AGTTCGAGAC CCCCAAGTAC TACGTCACCG TCATTGACGC CCCCGGTCAT 100 

CGCGATTTCA TCAAGAACAT GATCACTGGT ACCTCGCAGG CCGACTGCGC 150 

CATTCTCATC ATTGCCGCTG GTACTGGTGA GTTCGAGGCT GGTATCTCCA 200 

5 AGGATGGCCA GACTCGTGAG CACGCTCTGC TCGCCTACAC CCTGGGTGTG 250 

CGGCAGCTGA TCGTCGCCAT CAACAAGATG GACACGGCCA AGTGGGCTGA 300 

GGCTCGTTAC CAGGAGATCA TCAAGGAGAC CTCCAACTTC ATCAAGAAGG 350 

TCGGCTACAA CCCCAAGACT GTTGCCTTCG TCCCCATCTC GGGCTTCCAC 400 

GGCGACAACA TGCTTACTCC CTCGACCAAC TGCCCCTGGT ACAAGGGCTG 4 50 

10 GGAGAAGGAG GGCAAGAGCG GCAAGGTTAC CGGTAAGACT CTGCTGGACG 500 

CCATTGACGC CGTCGAGCCC CCCAAGCGCC CCACGGACAA GCCCCTGCGT 550 

CTGCCCCTCC AGGATGTCTA CAAGATCGGC GGTATCGGCA CTGTCCCTGT 600 

CGGCCGTATC GAGACTGGTG TCCTGAAGCC CGGCATGGTC GTCACCTTTG 650 

CCCCGTCCAA CGTCACCACT GAAGTCAAGT CCGTCGAGAT GCACCACGAG 7 00 

15 CAGCTTGTTG AGGGTGTTCC CGGCGACAAC GTCGGCTTCA ACGTCAAGAA 750 

CGTYTCCGTC AAGGAGATCC GTYGTGGCAA CGTTGCCGGT GACTCCAAGA 800 

ACGACCCCCC CTCGGGCGCC GCCACCTTCA ACGCCCAGGT CATTGTCCTG 850 

AACCACCCCG GCCAGGTCGG CAACGGCTAC GCCCCGGTTY TGGACTGCCA 900 

CACCGCCCAC ATTGCCTGCA AGTTCACCGA GATCCTTGAG AAGATCGACC 950 

2 0 GCCGTACCGG CAAGTCGGTT GAGAACAACC CCAAGTTCAT CAAGTCGGGT 1000 

GACGCCGGCA TTGTCAAGCT GACGCCCTYG AAGCCCATGT GCGTT 1045 



25 2) INFORMATION FOR SEQ ID NO: 1285 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 764 bases 

(B) TYPE: Nucleic acid 
30 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

35 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Aspergillus nidulans 

(B) STRAIN: WSA-176 



40 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1285 



TTGCCAGTGG TCCGAGGCCC GTTACAACGA AATCGTCAAG GAGACTTCCG 50 

GTTTCATCAA GAAGGTCGGA TACAACCCCA AGTCCGTTGC CTTCGTCCCC 100 

ATCTCCGGTT TCAACGGTGA CAACATGCTC GAGGCCTCTA CCAACTGCCC 150 

CTGGTACAAG GGTTGGGAGA AGGAGACCAA GGCCGGTAAG GCCACTGGTA 200 

45 AGACCCTCCT TGAGGCCATC GACGCCATTG AGCCCCCCAC CCGTCCCTCC 250 

AACAAGCCCC TCCGTCTTCC CCTCCAGGAT GTCTACAAGA TCTCCGGTAT 300 

TGGAACTGTC CCCGTCGGCC GTGTCGAGAC TGGTGTTATC ACCCCCGGCA 350 

TGGTCGTCAC CTTCGCTCCT GCCAACGTCA CCACTGAAGT CAAGTCGGTT 4 00 

GAGATGCACC ACCAGCAGCT CAAGGAGGGT GTCCCCGGTG ACAACGTCGG 4 50 

50 TTTCAACGTC AAGAACGTTT CCGTCAAGGA AATCCGTCGT GGTAACGTTG 500 

CCTCCGACTC CAAGAACGAC CCCGCCTCCG GCGCTGCCTC TTTCAACGCC 550 

CAGGTCATCG TTCTCAACCA CCCCGGTCAG GTCGGTGCTG GTTACGCCCC 600 

CGTCCTCGAC TGCCACACCG CCCACATTGC TTGCAAGTTC TCTGAGCTTC 650 
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TTGAGAAGAT TGACCGCCGT ACCGGAAAGG CTGTTGAAAC CAGCCCCAAG 7 00 

TTCATCAAGT CCGGTGACGC TGCCATCGTC AAGATGATTC CTTCCAAGCC 750 
CATGTGCGKT CCGA 7 64 



2) INFORMATION FOR SEQ ID NO: 1286 

(i) SEQUENCE CHARACTERISTICS: 
10 (A) LENGTH: 971 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

15 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Cladophialophora carrionii 

(B) STRAIN: ATCC 16264 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1286 



20 



ACCATCGATA TCGCGCTCTG GAAGTTCGAG ACTCCCAAGT ACTTCGTCAC 50 

CGTCATCGAT GCCCCTGGTC ATCGTGACTT CATCAAGAAC ATGATCACTG 100 

25 GTACCTCCCA GGCTGATTGT GCTATTCTCA TCATTGCCGC TGGTACTGGT 150 

GAGTTCGAGG CCGGTATCTC CAAGGATGGC CAGACCCGTG AGCATGCTCT 200 

GCTCGCCTAC ACCCTGGGCG TGAAGCAGCT TATCGTCGCC ATCAACAAGA 250 

TGGACACCAC CAAATGGTCT GAGGATCGTT TCAACGAAAT CATCAAGGAG 300 

ACTTCCAACT TCATCAAGAA GGTCGGATAC AACCCCAAGT CCGTTCCATT 350 

30 CGTGCCCATC TCCGGCTTCA ACGGTGACAA CATGATCGAC GTCTCCACCA 400 

ATGCCCCCTG GTACAAGGGC TGGGAAAAGG AGTCCAAGGC TGGCAAGGCC 450 

ACCGGCAAGA CCCTCCTTGA GGCTATCGAC TCCATCGACC CTCCTGCTCG 500 

TCCCACCGAC AAGCCTCTCC GTCTCCCACT CCAGGATGTC TACAAGATTT 550 

CTGGTATCGG CACGGTGCCC GTCGGTCGTG TTGAGACTGG TACCATCAAG 600 

35 GCCGGTATGG TCGTCACCTT TGCCCCCGCC AACGTCACCA CTGAAGTCAA 650 

GTCCGTCGAA ATGCACCACG AACAGCTYGC CGAGGGCGTT CCGGGTGACA 700 

ACGTCGGCTT CAACGTCAAG AACGTYTCCG TGAAGGAGGT TCGTCGTGGA 7 50 

AACGTTGCTG GTGACTCCAA GAACGACCCC CCCAAGGGTG CCGACTCCTT 800 

CAACGCCCAG GTCATCGTCC TCAACCACCC TGGTCAGGTC GGTGCTGGCT 850 

40 ACGCCCCGGT CTTGGATTGC CACACTGCCC ACATTGCCTG CAAGTTCTYT 900 

GAGCTCCTCG AGAAGATCGA TCGTCGKACC GGCAAGTCCA TGGAAAACAA 950 

CCCCAAGTTC ATCAAGTCTG G 971 



45 



2) INFORMATION FOR SEQ ID NO: 1287 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 7 32 bases 
50 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



699 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 



PCT/CA00/01150 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Exserohilum rostratum 
5 (B) STRAIN: WSA-215 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1287 



» GGTCTGAGGA CCGTTACCAG GAGATCATCA AG GAGACCT C CAACTTCATC 50 

10 AAGAAGGTCG GCTACAACCC CAAGCACGTT CCCTTCGTCC CCATCTCCGG 100 

TTTCAACGGA GACAACATGA TCGAGGCCTC CAGCAACTGC CCCTGGTACA 150 

AGGGTTGGGA GAAGGAGACC AAGGCCAAGG CCACTGGTAA GACCCTCCTT 200 

GAGGCCATTG ACGCCATCGA CCCTCCCAGC CGTCCTACCG ACAAGCCCCT 250 

CCGTCTTCCC CTCCAGGATG TCTACAAGAT TGGTGGTATT GGCACGGTTC 300 

15 CCGTCGGTCG TGTCGAGACC GGTATCATCA AGGCCGGTAT GGTCGTCACC 350 

TTCGCCCCCG CTGGTGTCAC CACTGAAGTC AAGTCCGTCG AGATGCACCA 4 00 

CGAGCAGCTT ACCGAGGGTG TCCCCGGTGA CAACGTCGGC TTCAACGTCA 4 50 

AGAACGTCTC CGTCAAGGAG ATCCGTCGTG GTAACGTTGC CGGTGACTCC 500 

AAGAACGACC CCCCCAAGGG CTGCGAGTCT TTCAACGCTC AGGTCATTGT 550 

2 0 CCTCAACCAC CCTGGTCAGG TCGGTGCCGG TTACGCGCCA GTCCTCGACT 600 

GCCACACCGC CCACATTGCC TGCAAGTTCT CTGAGCTCCT CGAGAAGATT 650 

GACCGCCGTA CCGGAAAGTC TGTCGAAGCC TCTCCCAAGT TCATCAAGTC 700 

TGGTGACGCG GCCATCGTCA AGATGGTTCC CT 7 32 
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2) INFORMATION FOR SEQ ID NO: 1288 



(i) SEQUENCE CHARACTERISTICS: 
30 (A) ' LENGTH: 337 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

35 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bacillus thuringiensis 

(B) STRAIN: HER 1236 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1288 



AATGGATCCT GTATACGCAC AAAAATTAGG CGTTAACATA GATGAATTRC 50 

TATTATCACA GCCTGATACA GGGGAGCAAG G ATT GGAAAT CGCGGAAGCA 100 

45 CTTGTACGAA GTGGTGCGGT TGACATTATC GTAATTGACT CTGTAGCAGC 150 

TCTTGTACCG AAAGCAGAGA TTGAAGGCGA TATGGGTGAC T CACACGT AG 200 

GTTTACAAGC ACGTTTAATG TCACAAGCAC TTMGTAAGCT TTCAGGAGCA 250 

ATCAACAAAT CARRARCAAT WGCAATCTTT ATTAACCAAA TTCGWGAAAA 300 

AGTTGGGGTT ATGTTCGGAA ACCCAGAAAC AACTCCA 337 

50 



2) INFORMATION FOR SEQ ID NO: 1289 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 336 bases 

(B) TYPE: Nucleic acid 

5 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

10 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bacillus thuringiensis 

(B) STRAIN: HER 1232 



15 



25 



35 



40 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1289 



AATGGATCCT GTATATGCAC AAAAATTAGG CGTTAACATA GATGAATTRC 50 

TATTATCACA GCCT GAT AC A GGGGAGCAAG GATTGGAAAT CGCGGAAGCA 100 

CTTGTACGAA GTGGTGCGGT TGACATTATC GTAATTGACT CTGTAGCAGC 150 

TCTTGTACCG AAAGCAGAGA TTGAAGGCGA TATGGGTGAC TCACACGTAG 200 

20 GTTTACAAGC ACGTTTAATG TCACAAGCAC TTMGTAAGCT TTCAGGAGCA 250 

ATCAACAAAT CARAARCAAT TGCAATCTTT ATTAACCAAA TTCGTGAAAA 300 

AGTTGGGGTT ATGTTCGGAA ACCCAGAAAC AACTCC 336 



2) INFORMATION FOR SEQ ID NO: 1290 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 23 bases 
30 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1290 
GAYTAYGCIA TGISIGTIAT HGT 23 

2 ) INFORMATION FOR SEQ ID NO: 1291 



(i) SEQUENCE CHARACTERISTICS: 
45 (A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

50 (ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1291 
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2) INFORMATION FOR SEQ ID NO: 1292 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1292 
ARISCYTCIA RIATRTGIGC 



2) INFORMATION FOR SEQ ID NO: 1293 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1293 
ATGGCTGAAT TACCTCAATC 



2) INFORMATION FOR SEQ ID NO: 1294 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1294 
ATGATTGTTG TATATCTTCT TCAAC 



2) INFORMATION FOR SEQ ID NO: 1295 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1295 
CAGAAAGTTT GAAGCGTTGT 



2) INFORMATION FOR SEQ ID NO: 12 96 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1296 
AACGATTCGT GAGTCAGATA 



2) INFORMATION FOR SEQ ID NO: 1297 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12 97 
CGGTCAACAT TGAGGAAGAG CT 



2) INFORMATION FOR SEQ ID NO: 1298 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 
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(D) TOPOLOGY: Linear 
(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1298 
ACGAAATCGA CCGTCTCTTT TTC 



23 



10 



15 



20 



25 



30 



35 



40 



45 



50 



2) INFORMATION FOR SEQ ID NO: 1299 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2711 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 



(vi) 



ORIGINAL SOURCE: 

(A) ORGANISM: Staphylococcus aureus 

(B) STRAIN: 601055 

(C) ACCESSION NUMBER: X71437 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1299 



ATGGCTGAAT 
AATGCGTGAA 
CATTGCCAGA 
TATGGATTAA 
AGCACGTATC 
CATCTATTTA 
TATCCGCTTG 
CGCAGCAGCA 
AACTGTTACG 
GATGGTAATG 
ATTAGCCAAT 
CACCACATAA 
AACCCTGATA 
TTTCCCAACT 
ATGAAACAGG 
GAACGTGGAG 
AGTGAATAAG 
AGAAAATTGA 
GGTGTGCGTG 
TTTAAATAAC 
ATATGATTGC 
GCGTTGGTAC 
GCAATATAAC 
TACGTATCGC 
TCAGATACAG 
TTCTGAAAAA 



TACCTCAATC 
TCATTTTTAG 
TGTTCGTGAC 
AT GAACAAGG 
GTTGGTGACG 
TGAAGCAATG 
TTGATGGCCA 
ATGCGTTATA 
TGATATTAAT 
AAAGAGAGCC 
GGTGCATCAG 
CTTAACAGAA 
TTTCAATTGC 
GCTGGACTTA 
TCGTGGTTCA 
GCGGACGTCA 
GCTCGTATGA 
CGGTATCACT 
TCGTTATTGA 
TTATACAAAC 
ACTTGTAAAT 
ATTATTTAGA 
TTACGTAAAG 
ACTTGACCAT 
ATAAAGTTGC 
CAAGCTCAAG 



AAGAATAAAT 
ATTATGCGAT 
GGTTTAAAAC 
TATGACACCG 
TAATGGGTAA 
GTACGTATGG 
AGGTAACTTT 
CTGAAGCGCG 
AAAGATACAA 
GTCAGTCTTA 
GTATCGCGGT 
TTAATCAATG 
TGAGTTAATG 
TTTTAGGTAA 
ATTCAAATGC 
ACGTATTGTT 
TTGAAAAAAT 
GATTTACGTG 
TGTGCGTAAG 
AAACACCTCT 
GGTAGACCGA 
GCATCAAAAG 
CTAAAGATCG 
ATCGATGAAA 
AATGGAAAGC 
CTATTTTAGA 



GAACGAAATA 
GAGTGTTATC 
CAGTACATCG 
GATAAATCAT 
ATATCACCCT 
CTCAAGATTT 
GGTTCAATGG 
TATGAC TAAA 
TAGATTTTAT 
CCTGCTCGAT 
AGGTATGGCA 
GTGTACTTAG 
GAAGATATTG 
GAGTGGTATT 
GTTCTCGTGC 
GTCACTGAAA 
TGCAGAGCTC 
AT GAAAC AAG 
GATGCAAATG 
TCAAACATCA 
AGCTTATTAA 
ACAGTTGTTA 
TGCCCATATT 
TTATTTCAAC 
TTGCAACAAC 
CATGCGTTTA 



TTACCAGTGA 50 

GTTGCTCGTG 100 

TCGTATACTA 150 

ATAAAAAATC 200 

CATGGTGACT 250 

CAGTTATCGT 300 

ATGGAGATGG 350 

ATCACACTTG 4 00 

CGATAACTAT 4 50 

TCCCTAATTT 500 

ACGAATATTC 550 

CTTAAGTAAG 600 

AAGGTCCTGA 650 

AGACGT GCAT 700 

AGTTATTGAA 750 

TTCCTTTCCA 800 

GTTCGTGACA 850 

TTTACGTACT 900 

CTAGTGTCAT 950 

TTTGGTGTGA 1000 

TTTAAAAGAA 1050 

GAAGAC GTAC 1100 

TTAGAAGGGT 1150 

GATTCGTGAG 1200 

GCTTCAAACT 1250 

AGACGTCTAA 1300 
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CAGGTTTAGA GAGAAACAAA ATTGAAGCTG AATATAATGA GTTATTAAAT 1350 

TATATTAGTG AATTAGAAGC CATCTTAGCT GATGAAGAAG TGTTATTACA 1400 

GTTAGTTAGA GATGAATTGA CTGAAATTAG AGATCGTTTC GGTGATGAGC 1450 

GTCGTACAGA AATTCAATTA GGTGGATTTG AAGACTTAGA GGACGAAGAC 1500 

- 5 TTAATTCCAG AAGAACAAAT AGTAATTACT TTGAGCCATA ATAACTACAT 1550 

TAAACGTTTG CCGGTATCTA CATATCGTGC TCAAAACCGT GGTGGTCGTG 1600 

GTGTTCAAGG TATGAATACA TTGGAAGAAG ATTTTGTCAG TCAATTGGTA 1650 

ACTTTAAGTA CACATGACCA TGTATTGTTC TTTACTAACA AAGGTCGTGT 1700 

ATACAAACTA AAAGGTTATG AAGTGCCTGA GTTATCAAGA CAGTCTAAAG 1750 

10 GTATTCCTGT AGTGAATGCT ATTGAACTTG GAAATGATGA AGTCATTAGT 1800 

ACAATGATTG CTGTTAAAGA CCTTGAAAGT GAAGACAACT TCTTAGTGTT 1850 

TGCAACTAAA CGTGGTGTTG TTAAACGTTC AGCATTAAGT AACTTCTCAA 1900 

GAATAAATAG AAATGGTAAG ATTGCGATTT CGTTCAGAGA AGATGATGAG 1950 

TTAATTGCAG TTCGTTTAAC AAGTGGTCAA GAAGATATCT TGATTGGTAC 2000 

15 ATCACATGCA TCATTAATTC GATTCCCTGA ATCAACATTA CGTCCTTTAG 2050 

GCCGTACAGC AACGGGTGTG AAAGGTATTA CACTTCGTGA AGGTGACGAA 2100 

GTTGTAGGGC TTGATGTAGC TCATGCAAAC AGTGTTGATG AAGTATTAGT 2150 

AGTTACTGAA AATGGTTATG GTAAACGTAC GCCAGTTAAT GACTATCGTT 2200 

TATCAAATCG TGGTGGTAAA GGTATTAAAA CAGCTACGAT TACTGAGCGT 2250 

2 0 AATGGTAATG TTGTATGTAT CACTACAGTA ACTGGTGAAG AAGATTTAAT 2300 

GATTGTTACT AATGCCGGTG TCATTATTCG ACTAGATGTT GCAGATATTT 2350 

CTCAAAATGG TCGTGCAGCA CAAGGTGTTC GCTTAATTCG CTTAGGCGAT 2400 

GATCAATTTG TTTCAACGGT TGCTAAAGTA AAAGAGGATG CAGATGAAGT 2450 

AAATGAAGAT GAACAATCTA CTGTATCTGA AGATGGTACT GAACAACAAC 2500 

25 GTGAAGCGGT TGTAAATGAT GAAACACCAG GAAATGCAAT TCATACTGAA 2550 

GTGATTGATT CAGAAGAAAA TGATGAAGAT GGACGTATTG AAGTAAGACA 2 600 

AGATTTCATG GATCGTGTTG AAGAAGATAT ACAACAATCA TCAGATGAAG 2 650 

ATGAAGAATA A 2711 

30 



2) INFORMATION FOR SEQ ID NO: 1300 



(i) SEQUENCE CHARACTERISTICS: 
35 (A) LENGTH: 2628 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

( D) TOPOLOGY : Linear 

40 (ii) MOLECULE TYPE: Genomic DNA 



<vi) ORIGINAL SOURCE: 

(A) ORGANISM: Escherichia coli 

(B) STRAIN: K12 

45 (C) ACCESSION NUMBER: X57174 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1300 

ATGAGCGACC TTGCGAGAGA AATTACACCG GTCAACATTG AGGAAGAGCT 50 

50 GAAGAGCTCC TATCTGGATT ATGCGATGTC GGTCATTGTT GGCCGTGCGC 100 

TGCCAGATGT CCGAGATGGC CTGAAGCCGG TACACCGTCG CGTACTTTAC 150 

GCCATGAACG TACTAGGCAA TGACTGGAAC AAAGCCTATA AAAAATCTGC 200 

CCGTGTCGTT GGTGACGTAA TCGGTAAATA CCATCCCCAT GGTGACTCGG 250 
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CGGTCTATGA CACGATCGTC CGCATGGCGC AGCCATTCTC GCTGCGTTAT 300 
ATGCTGGTAG ACGGTCAGGG TAACTTCGGT TCTATCGACG GCGACTCTGC 350 
GGCGGCAATG CGTTATACGG AAATCCGTCT GGCGAAAATT GCCCATGAAC 4 00 

TGATGGCCGA TCTCGAAAAA GAGACGGTCG ATTTCGTTGA TAACTATGAC 450 
5 GGCACGGAAA AAATTCCGGA CGTCATGCCA ACCAAAATTC CTAACCTGCT 500 
GGTGAACGGT TCTTCCGGTA TCGCCGTAGG TATGGCAACC AACATCCCGC 550 
CGCACAACCT GACGGAAGTC ATCAACGGTT GTCTGGCGTA TATTGATGAT 600 
GAAGACATCA GCATTGAAGG GCTGATGGAA CACATCCCGG AGCCGGACTT 650 
CCCGACGGCG G C AAT CAT T A ACGGTCGTCG CGGTATTGAA GAAGCTTACC 700 

10 GTACCGGTCG CGGCAAGGTG TATATCCGCG CTCGCGCAGA AGTGGAAGTT 750 
GACGCCAAAA CCGGTCGTGA AACCATTATC GTCCACGAAA TTCCGTATCA 800 
GGTAAACAAA GCGCGCCTGA TCGAGAAGAT TGCGGAACTG GTAAAAGAAA 850 
AACGCGTGGA AG GC AT C AG C GCGCTGCGTG ACGAGTCTGA CAAAGACGGT 900 
ATGCGCATCG TGATTGAAGT GAAACGCGAT GCGGTCGGTG AAGTTGTGCT 950 

15 CAACAACCTC TACTCCCAGA CCCAGTTGCA GGTTTCTTTC GGTATCAACA 1000 

TGGTGGCATT GCACCATGGT CAGCCGAAGA TCATGAACCT GAAAGACATC 1050 

ATCGCGGCGT TTGTTCGTCA CCGCCGTGAA GTGGTGACCC GTCGTACTAT 1100 

TTTCGAACTG CGTAAAGCTC GCGATCGTGC TCATATCCTT GAAGCATTAG 1150 

CCGTGGCGCT GGCGAACATC GACCCGATCA TCGAACTGAT CCGTCATGCG 1200 

20 CCGACGCCTG CAGAAGCGAA AACTGCGCTG GTTGCTAATC CGTGGCAGCT 1250 

GGGCAACGTT GCCGCGATGC TCGAACGTGC TGGCGACGAT GCTGCGCGTC 1300 

CGGAATGGCT GGAGCCAGAG TTCGGCGTGC GTGATGGTCT GTACTACCTG 1350 

ACCGAACAGC AAGCTCAGGC GATTCTGGAT CTGCGTTTGC AGAAACTGAC 1400 

CGGTCTTGAG CACGAAAAAC TGCTCGACGA ATACAAAGAG CTGCTGGATC 1450 

2 5 AGATCGCGGA ACTGTTGCGT ATTCTTGGTA GCGCCGATCG TCTGATGGAA 1500 

GTGATCCGTG AAGAGCTGGA GCTGGTTCGT GAACAGTTCG GTGACAAACG 1550 

TCGTACTGAA ATCACCGCCA ACAGCGCAGA CATCAACCTG GAAGATCTGA 1600 

TCACCCAGGA AGATGTGGTC GTGACGCTCT CTCACCAGGG CTACGTTAAG 1650 

TATCAGCCGC TTTCTGAATA CGAAGCGCAG CGTCGTGGCG GGAAAGGTAA 1700 

30 ATCTGCCGCA CGTATTAAAG AAGAAGACTT TATCGACCGA CTGCTGGTGG 1750 

CGAACACTCA C G AC CAT AT T CTGTGCTTCT CCAGCCGTGG TCGCGTCTAT 1800 

TCGATGAAAG TTTATCAGTT GCCGGAAGCC ACTCGTGGCG CGCGCGGTCG 1850 

TCCGATCGTC AACCTGCTGC CGCTGGAGCA GGACGAACGT ATCACTGCGA 1900 

TCCTGCCAGT GACCGAGTTT GAAGAAGGCG TGAAAGTCTT CATGGCGACC 1950 

35 GCTAACGGTA CCGTGAAGAA AACTGTCCTC ACCGAGTTCA ACCGTCTGCG 2000 

TACCGCCGGT AAAGTGGCGA TCAAACTGGT TGACGGCGAT GAGCTGATCG 2050 

GCGTTGACCT GACCAGCGGC GAAGACGAAG TAATGCTGTT CTCCGCTGAA 2100 

GGTAAAGTGG TGCGCTTTAA AGAGTCTTCT GTCCGTGCGA TGGGCTGCAA 2150 

CACCACCGGT GTTCGCGGTA TTCGCTTAGG TGAAGGCGAT AAAGTCGTCT 2200 

40 CTCTGATCGT GCCTCGTGGC GATGGCGCAA TCCTCACCGC AACGCAAAAC 2250 

GGTTACGGTA AACGTACCGC AGTGGCGGAA TACCCAACCA AGTCGCGTGC 2300 

GACGAAAGGG GTTATCTCCA TCAAGGTTAC CGAACGTAAC GGTTTAGTTG 2350 

TTGGCGCGGT ACAGGTAGAT GACTGCGACC AGATCATGAT GATCACCGAT 2400 

GCCGGTACGC TGGTACGTAC TCGCGTTTCG GAAATCAGCA TCGTGGGCCG 2450 

45 TAACACCCAG GGCGTGATCC TCATCCGTAC TGCGGAAGAT GAAAACGTAG 2500 

TGGGTCTGCA ACGTGTTGCT GAACCGGTTG ACGAGGAAGA TCTGGATACC 2550 

ATCGACGGCA GTGCCGCGGA AGGGGACGAT GAAATCGCTC CGGAAGTGGA 2 600 

CGTTGACGAC GAGCCAGAAG AAGAATAA 2 628 

50 

2) INFORMATION FOR SEQ ID NO: 1301 
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30 



4 0 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1301 
GTIMGIAWIM GICCIGSIAT GTA 23 



15 2 ) INFORMATION FOR SEQ ID NO: 1302 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 
20 (C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 
25 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1302 

TAIADIGGIG GIKKIGCIAT RTA 23 



2) INFORMATION FOR SEQ ID NO: 1303 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 20 bases 
35 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1303 
GGIGAIGAID Y IMG I GARGG 20 

2) INFORMATION FOR SEQ ID NO: 1304 



(i) SEQUENCE CHARACTERISTICS: 
50 (A) LENGTH: 19 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 
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(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1304 

5 

CIARYTTIKY ITTIGTYTG 19 

10 2) INFORMATION FOR SEQ ID NO: 1305 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 bases 

(B) TYPE: Nucleic acid 
15 (C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 
20 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1305 

ATGGTGACTG CATTGTCAGA TG 22 



25 



35 



40 



2) INFORMATION FOR SEQ ID NO: 1306 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 22 bases 
30 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1306 
GTCTACGGTT TTCTACAACG TC 22 

2) INFORMATION FOR SEQ ID NO: 1307 



(i) SEQUENCE CHARACTERISTICS: 
45 (A) LENGTH: 1923 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

50 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Staphylococcus aureus 
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(C) ACCESSION NUMBER: M86227 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1307 

5 ATGGTGACTG CATTGTCAGA TGTAAACAAC ACGGATAATT ATGGTGCTGG 50 

GCAAATACAA GTATTAGAAG GTTTAGAAGC AGTACGTAAA AGACCAGGTA 100 

TGTATATAGG ATCGACTCAG AGAGAGTTGC ACATTAGTGT GGAAATTGTC 150 

GATAATAGTA TCGATGAAGC ATTAGCTGGT TATGCAAATA AAATTGAAGT 200 

TGTTATTGAA AAAGATAACT GGATTAAAGT AACGGATAAC GGACGTGGTA 250 

10 TCCCAGTTGA TATTCAAGAA AAAATGGGAC GTCCAGCTGT CGAAGTTATT 300 

TTAACTGTTT TACATGCTGG TGGTAAATTC GGCGGTGGCG GATACAAAGT 350 

ATCTGGTGGT TTACATGGTG TTGGTTCATC AGTTGTAAAC GCATTGTCAC 4 00 

AAGACTTAGA AGTATATGTA CACAGAAATG AGACTATATA TCATCAAGCA 450 

TATAAAAAAG GTGTACCTCA ATTTGACTTA AAAGAAGTTG GCACAACTGA 500 

15 TAAGACAGGT ACTGTCATTC GTTTTAAAGC AG AT GG AGAA ATCTTCACAG 550 

AGACAACTGT ATACAACTAT GAAACATTAC AGCAGCGTAT TAGAGAGCTT 600 

GCTTTCTTAA ACAAAGGAAT TCAAATCACA TTAAGAGATG AACGTGATGA 650 

AGAAAACGTT AGAGAAGACT CCTATCACTA TGAGGGCGGT ATTAAATCGT 700 

ACGTTGAGTT ATTGAACGAA AATAAAGAAC CTATTCATGA TGAGCCAATT 750 

2 0 TATATTCATC AATCTAAAGA TGATATTGAA GTAGAAATTG CGATTCAATA 800 

TAACTCAGGA TATGCCACAA ATCTTTTAAC TTAC GCAAAT AACATTCATA 850 

CGTACGAAGG TGGTACGCAT GAAGAC GG AT TCAAACGTGC ATTAACGCGT 900 

GTCTTAAATA GTTATGGTTT AAGTAGCAGA TATGAAGAAG AAAAGATAGC 950 

TTCTGGTGAA GATACACGAG AAGGTATGAC AGCAATTATA TCTATCAAAC 1000 

25 ATGGTGATCC TCAATTCGAA GGTCAAACGA AGACAAAATT AGGTAATTCT 1050 

GAAGTGCGTC AAGTTGTAGA TAAATTATTC TCAGAGCACT TTGAACGATT 1100 

TTTATATGAA AATCCACAAG TCGCACGTAC AGTGGTTGAA AAAGGTATTA 1150 

TGGCGGCACG TGCACGTGTT GCTGCGAAAA AAGCGCGTGA AGTAACACGT 1200 

CGTAAATCAG CGTTAGATGT AGCAAGTCTT CCAGGTAAAT TAGCCGATTG 1250 

30 CTCTAGTCAA AGTCCTGAAG AATGTGAGAT TTTCTTAGTC GAAGGGGACT 1300 

CTGCCGGAGG GTCTACAAAA TCTGGTCGTG ACTCTAGAAC GCAGGCGATT 1350 

TTACCATTAC GAGGTAAGAT ATTAAATGTT GAAAAAGCAC GATTAGATAG 1400 

AATTTTGAAT AAC AAT G AAA TTCGTCAAAT GATCACAGCA TTTGGTACAG 14 50 

GAATCGGTGG CGACTTTGAT CTAGCGAAAG CAAG AT AT C A CAAAATCGTC 1500 

35 ATTATGACTG ATGCCGATGT GGATGGAGCG CATATTAGAA CATTGTTATT 1550 

AATATTCTTC TATCGATTTA TGAGACCGTT AATTGAAGCA GGCTATGTGT 1600 

ATATTGCACA GCCACCGTTG TATAAACTGA CACAAGGTAA ACAAAAGTAT 1650 

TATGTATACA AT G AT AGGG A ACTTGATAAA CTTAAATCTG AATTGAATCC 1700 

AACACCAAAA TGGTCTATTG CGCTATACAA AGGTCTTGGA GAAATGAATG 1750 

40 CAGATCAATT AT GGGAAACA ACAATGAACC CTGAGCACCG CGCTCTTTTA 1800 

CAAGTAAAAC TTGAAGATGC GATTGAAGCG GACCAAACAT TTGAAATGTT 1850 

AATGGGTGAC GTTGTAGAAA ACCGTAGACA ATTTATAGAA GATAATGCAG 1900 

TTTATGCAAA CTTAGACTTC TAA 1923 



45 



2) INFORMATION FOR SEQ ID NO: 1308 



(i) SEQUENCE CHARACTERISTICS: 
50 (A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 



709 

SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 



PCT/CAOO/01150 



(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1308 
ATGTAYGTIA TIATGGAYMG IGC 



2) INFORMATION FOR SEQ ID NO: 1309 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1309 
ATIATYTTRT TICCYTTICC YTT 



2) INFORMATION FOR SEQ ID NO: 1310 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1310 
ATIATITSIA TIACYTCRTC 



2) INFORMATION FOR SEQ ID NO: 1311 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1311 
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5 2) INFORMATION FOR SEQ ID NO: 1312 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 
10 (C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 
15 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1312 

AARTAYATIA TICARGARMG IGC 2 3 



20 



30 



35 



2) INFORMATION FOR SEQ ID NO: 1313 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 23 bases 
25 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1313 
AMIAYICKRT GIGGITTITT YTT 23 

2) INFORMATION FOR SEQ ID NO: 1314 



SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

4 5 (ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1314 
TAIGAITTYA CIGAISMICA RGC 23 

50 



(i) 

40 



2) INFORMATION FOR SEQ ID NO: 1315 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1315 
ACIATIGCIT CIGCYTGIKS YTC 



2) INFORMATION FOR SEQ ID NO: 1316 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 bases 

(B) TYPE: . Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1316 
GTGAGTGAAA TAATTCAAGA TT 



2) INFORMATION FOR SEQ ID NO: 1317 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1317 
CACCAAAATC ATCTGTATCT AC 



2) INFORMATION FOR SEQ ID NO: 1318 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 
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(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

5 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1318 

ACCTAYTCSA TGTACGTRAT CATGGA 26 

10 

2) INFORMATION FOR SEQ ID NO: 1319 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 23 bases 
15 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 



20 



(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1319 
AGRTCGTCIA CCATCGGYAG YTT 23 

25 

2) INFORMATION FOR SEQ ID NO: 1320 

(i) SEQUENCE CHARACTERISTICS: 
30 (A) LENGTH: 2259 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

35 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Escherichia coll 

(B) STRAIN: K-12 MG1655 

40 (C) ACCESSION NUMBER: AE000384 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1320 

ATGAGCGATA TGGCAGAGCG CCTTGCGCTA CATGAATTTA CGGAAAACGC 50 

45 CTACTTAAAC TACTCCATGT ACGTGATCAT GGACCGTGCG TTGCCGTTTA 100 

TTGGTGATGG TCTGAAACCT GTTCAGCGCC GCATTGTGTA TGCGATGTCT 150 

GAACTGGGCC TGAATGCCAG CGCCAAATTT AAAAAATCGG CCCGTACCGT 200 

CGGTGACGTA CTGGGTAAAT ACCATCCGCA CGGCGATAGC GCCTGTTATG 250 

AAGCGATGGT CCTGATGGCG CAACCGTTCT CTTACCGTTA TCCGCTGGTT 300 

50 GATGGTCAGG GGAACTGGGG CGCGCCGGAC GATCCGAAAT CGTTCGCGGC 350 

AATGCGTTAC ACCGAATCCC GGTTGTCGAA ATATTCCGAG CTGCTATTGA 400 

GCGAGCTGGG GCAGGGGACG GCTGACTGGG TGCCAAACTT CGACGGCACT 4 50 

TTGCAGGAGC CGAAAATGCT ACCTGCCCGT CTGCCAAACA TTTTGCTTAA 500 
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CGGCACCACC GGTATTGCCG TCGGCATGGC GACCGATATT CCACCGCATA 550 

ACCTGCGTGA AGTGGCTCAG GCGGCAATCG CATTAATCGA CCAGCCGAAA 600 

ACCACGCTCG ATCAGCTGCT GGATATCGTG CAGGGGCCGG ATTATCCGAC 650 

TGAAGCGGAA ATTATCACTT CGCGCGCCGA GATCCGTAAA ATCTACGAGA 7 00 

5 ACGGACGTGG TTCAGTGCGT ATGCGCGCGG TGTGGAAGAA AGAAGATGGC 7 50 

GCGGTGGTTA TCAGCGCATT GCCGCATCAG GTTTCAGGTG CGCGCGTACT 8 00 

GGAGCAAATT GCTGCGCAAA TGCGCAACAA AAAGCTGCCG ATGGTTGACG 850 

ATCTGCGCGA TGAATCTGAC CACGAGAACC CGACCCGCCT GGTGATTGTG 900 

CCGCGTTCCA ACCGCGTGGA TATGGATCAG GTGATGAACC ACCTCTTCGC 950 

10 TACCACCGAT CTGGAAAAGA GCTATCGTAT TAACCTTAAT ATGATCGGTC 1000 

TGGATGGTCG TCCGGCGGTG AAAAACCTGC TGGAAATCCT CTCCGAATGG 1050 

CTGGTGTTCC GCCGCGATAC CGTGCGCCGC CGACTGAACT ATCGTCTGGA 1100 

GAAAGTCCTC AAGCGCCTGC ATATCCTCGA AGGTTTGCTG GTGGCGTTTC 1150 

TCAATATCGA CGAAGTGATT GAGATCATTC GTAATGAAGA TGAACCGAAA 1200 

15 CCGGCGCTGA TGTCGCGGTT TGGCCTTACG GAAACCCAGG CGGAAGCGAT 1250 

CCTCGAACTG AAACTGCGTC ATCTTGCCAA ACTGGAAGAG ATGAAGATTC 1300 

GCGGTGAGCA GAGTGAACTG GAAAAAGAGC GCGACCAGTT GCAGGGCATT 1350 

TTGGCTTCCG AGCGTAAAAT GAATAACCTG CTGAAGAAAG AACTGCAGGC 14 00 

AGACGCGCAA GCCTACGGTG ACGATCGTCG TTCGCCGTTG CAGGAACGCG 14 50 

20 AAGAAGCGAA AGCGATGAGC GAGCACGACA TGCTGCCGTC TGAACCTGTC 1500 

ACCATTGTGC TGTCGCAGAT GGGCTGGGTA CGCAGCGCTA AAGGCCATGA 1550 

TATCGACGCG CCGGGCCTGA ATTATAAAGC GGGTGATAGC TTCAAAGCGG 1600 

CGGTGAAAGG TAAGAGCAAC CAACCGGTAG TGTTTGTTGA TTCCACCGGT 1650 

CGTAGCTATG CCATTGACCC GATTACGCTG CCGTCGGCGC GTGGTCAGGG 1700 

25 CGAGCCGCTC ACCGGCAAAT TAACGTTGCC GCCTGGGGCG ACCGTTGACC 1750 

ATATGCTGAT GGAAAGCGAC GATCAGAAAC TGCTGATGGC TTCCGATGCG 1800 

GGTTACGGTT TCGTCTGCAC CTTTAACGAT CTGGTGGCGC GTAACCGTGC 1850 

AGGTAAGGCT TTGATCACCT TACCGGAAAA TGCCCATGTT ATGCCGCCGG 1900 

TGGTGATTGA AGATGCTTCC GATATGCTGC TGGCAATCAC TCAGGCAGGC 1950 

30 CGTATGTTGA TGTTCCCGGT AAGTGATCTG CCGCAGCTGT CGAAGGGCAA 2000 

AGGCAACAAG ATTATCAACA TTCCATCGGC AGAAGCCGCG CGTGGAGAAG 2050 

ATGGTCTGGC GCAATTGTAC GTTCTGCCGC CGCAAAGCAC GCTGACCATT 2100 

CATGTTGGGA AACGCAAAAT TAAACTGCGC CCGGAAGAGT TACAGAAAGT 2150 

CACTGGCGAA CGTGGACGCC GCGGTACGTT GATGCGCGGT TTGCAGCGTA 2200 

35 TCGATCGTGT TGAGATCGAC TCTCCTCGCC GTGCCAGCAG CGGTGATAGC 2250 

GAAGAGTAA 2259 



40 2) INFORMATION FOR SEQ ID NO: 1321 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2403 bases 

(B) TYPE: Nucleic acid 
45 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

50 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Staphylococcus aureus 

(B) STRAIN: KMP9 

(C) ACCESSION NUMBER: D67074 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1321 

GTGAGTGAAA TAATTGAAGA TTTATCACTT GAAGATGTTT TAGGTGATCG 50 

5 CTTTGGAAGA TATAGTAAAT ATATTATTCA AGAGCGTGCA TTGCCAGATG 100 

TTCGTGATGG TTTAAAACCA GTACAACGTC GTATTTTATA TGCAATGTAT 150 

TCAAGTGGTA AT ACAC AC GA TAAAAATTTC CGTAAAAGTG CGAAAACAGT 200 

CGGTGATGTT ATTGGTCAAT ATCATCCACA TGGAGACTTC TCAGTGTACA 250 

AAGCAATGGT CCGTTTAAGT CAAGACTGGA AGTTACGACA TGTCTTAATA 300 

10 GAAATGCATG GTAATAATGG TAGTATCGAT AATGATCCGC CAGCGGCAAT 350 

GCGTTACACT GAAGCTAAGT TAAGCTTACT AGCTGAAGAG TTATTACGTG 400 

ATATTAATAA AGAGACAGTT TCTTTCATTC CAAACTATGA TGATACGACA 450 

CTCGAACCAA TGGTATTGCC ATCAAGATTT CCTAACTTAC TAGTGAATGG 500 

TTCTACAGGT ATATCTGCAG GTTACGCGAC AGATATACCA CCACATAATT 55Q 

15 TAGCTGAAGT GATTCAAGCA ACACTTAAAT AT ATT GAT AA TCCGGATATT 600 

ACAGTCAATC AATTAATGAA ATAT AT T AAA GGTCCTGATT TTCCAACTGG 650 

TGGTATTATT CAAGGTATTG ATGGTATTAA AAAAGCTTAT GAATCAGGTA 700 

AAGGTAGAAT TATAGTTCGT TCTAAAGTTG AAGAAGAAAC TTTACGCAAT 750 

GGACGTAAAC AGTTAATTAT TACTGAAATT CCATATGAAG TGAACAAAAG 800 

2 0 TAGCTTAGTA AAACGTATCG ATGAATTACG TGCTGACAAA AAAGTCGATG 850 

GTATCGTTGA AGTACGTGAT GAAACTGATA GAACTGGTTT ACGAATAGCA 900 

ATTGAATTGA AAAAAGATGT GAACAGTGAA TCAATCAAAA ATTATCTTTA 950 

TAAAAACTCT GATTTACAGA TTTCATATAA TTTCAACATG GTCGCTATTA 1000 

GTGATGGTCG TCCAAAATTG ATGGGTATTC GTCAAATTAT AGATAGTTAT 1050 

25 TTGAATCATC AAATTGAGGT TGTTGCAAAT AGAACGAAGT TTGAATTAGA 1100 

TAATGCTGAA AAACGTATGC ATATCGTTGA AGGTTTGATT AAAGCGTTGT 1150 

CAATTTTAGA TAAAGTAATT GAATTGATTC GTAGCTCTAA AAACAAGCGT 1200 

GACGCTAAAG AAAACCTTAT CGAAGTATTC GAGTTCACAG AAGAACAGGC 1250 

TGAAGCAATT GTAATGTTAC AGTTATATCG TTTAACAAAC ACTGACATAG 1300 

3 0 TTGCGCTTGA AGGTGAACAT AAAGAACTTG AAGC AT TAAT CAAACAATTA 1350 

CGTCATATTC TTGATAACCA TGATGCATTA TTGAATGTCA TAAAAGAAGA 14 00 

ATTGAATGAA ATTAAAAAGA AATTCAAATC TGAACGACTG TCTTTAATTG 1450 

AAGCAGAAAT TGAAGAAATT AAAATTGACA AAGAAGTTAT GGTGCCTAGT 1500 

GAAGAAGTTA TTTTAAGTAT GACACGTCAT GGATATATTA AACGTACTTC 1550 

35 TATTCGTAGC TTTAATGCTA GCGGTGTTGA GGATATTGGT TTAAAAGATG 1600 

GTGACAGTTT ACTTAAACAT CAAGAAGTAA ATACGCAAGA TACCGTACTA 1650 

GTATTTACAA ATAAAGGTCG TTATCTATTT ATACCGGTTC ATAAATTAGC 1700 

AGATATTCGT T GGAAAG AAT TGGGGCAACA TGTATCACAA ATAGTTCCTA 1750 

TCGAAGAAGA TGAAGTGGTT ATTAATGTCT TTAATGAAAA GGACTTTAAT 1800 

40 ACAGATGCAT TTTATGTTTT TGCGACTCAA AATGGCATGA TTAAGAAAAG 1850 

TACAGTGCCT CTATTTAAAA CAACGCGTTT TAATAAACCT TTAATTGCTA 1900 

CTAAAGTTAA AGAAAATGAT GATTTGATTA GTGTTATGCG CTTTGAAAAA 1950 

GATCAATTAA TTACCGTCAT T AC TAAT AAA GGTATGTCAT TAACGTATAA 2000 

TACAAGTGAA CTATCAGATA CCGGATTAAG GGCAGCTGGT GTTAAATCAA 2050 

45 TAAATCTTAA AGCTGAAGAT TTCGTTGTTA TGACAGAAGG TGTTTCTGAA 2100 

AATGATACTA TATTGATGGC CACACAACGC GGCTCGTTAA AACGTATTAG 2150 

TTTTAAAATC TTACAAGTTG CTAAAAGAGC ACAACGTGGA ATAACTTTAT 2200 

T AAAAGAAT T AAAGAAAAAT CCACATCGTA TTGTAGCTGC ACATGTAGTG 2250 

ACAGGTGAAC ATAGTCAATA TACATTATAT TCAAAATCAA ATGAAGAACA 2300 

50 TGGTTTAATT AATGATATTC ATAAATCTGA ACAATATACA AATGGCTCAT 2350 

TCATTGTAGA TACAGATGAT TTTGGTGAAG TAATAGACAT GTATATTAGC 2400 

TAA 2403 



715 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 



PCT/CA00/01 150 



2) INFORMATION FOR SEQ ID NO: 1322 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1322 
RTIGAIAAYI SIGTIGAYGA RG 



2) INFORMATION FOR SEQ ID NO: 1323 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1323 
ACIAWRSAIG GIGGIACICA YG 



2) INFORMATION FOR SEQ ID NO: 1324 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1324 
CCICCIGCIS WRTCICCYTC " ' 



2) INFORMATION FOR SEQ ID NO: 1325 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 bases 

(B) TYPE: Nucleic acid 
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(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1325 
RTTCATYTCI CCIARICCYT T 



2) INFORMATION FOR SEQ ID NO: 1326 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1326 
TGATTCAATA CAGGTTTTAG AG 



2) INFORMATION FOR SEQ ID NO: 1327 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1327 
CTAGATTTCC TCCTCATCAA AT 



2) INFORMATION FOR SEQ ID NO: 1328 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1992 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 
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(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Staphylococcus aureus 

(B) STRAIN: RN4220 

(C) ACCESSION NUMBER: D67075 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1328 



ATGAATAAAC 
GTTAGAAGCA 
10 AACGGGGATT 
GAAGTATTGA 
TGGTAGTATT 
ATAAATCAGG 
GGAGGTAAAT 
15 TGTTGGTGCT 
TCCATCGAGA 
CCATCTTCTG 
AGTAACATTT 
ATTTTGATGT 
2 0 AATTTAAAAA 
GCATTACCAT 
AAGGAAAAGA 
GGTATAGAGG 
TATTTTAAGT 
25 AAGTTGGTTT 
CGTATTAATG 
TCGTGAAGGT 
TGCAATTTGA 
AGTGCTGTTG 
30 AAAAGGACAA 
AAGCAAGGGA 
AAAAACAAGC 
AAGTAAAAAC 
CGGGAGGTTC 
35. CCATTACGTG 
TTTTAAAAAT 
TTGGTACTGA 
ATGACTGATG 
ATTCTTCTTC 
4 0 TTGCTTTACC 
CGAGTTGAAT 
ACTTGGTAAA 
ACCCTGAGCA 
ATTCGTGTAC 
45 ATTAATGGGT 
TTGAGTTTGG 
CAAGTGCTTG 



AAAATAATTA 

GTTCGTAAAA 

ACATCATCTA 

ATGGTTACGG 

TCTATAGAAG 

TAAACCGACA 

TTGGACAAGG 

TCAGTTGTAA 

TGGTAATATA 

GTTTAGTGAA 

AAACCTGATG 

TTTAAGTGAA 

TAACGCTTAA 

TATGAAGAAG 

AGTTTTGCAT 

TAGACGTAGC 

TTTGTAAATA 

TAAAACAGCA 

AACTTAAAAC 

TTAACAGCTG 

AGGACAAACG 

ATTCAGTTGT 

TTGTCTAAAT 

AGCTGCACGT 

GTAAAGACAC 

ACTGAAAAAA 

AGCAAAACTT 

GTAAGGTAAT 

GAAGAAATTA 

CTTTAAAATT 

CTGATACTGA 

AAAT AT AT G A 

TCCACTTTAT 

ACGCTTGGAC 

GGCTTCACGT 

ATTATGGGAA 

AAGTTGAAGA 

GACAAAGTAC 

TATGCAAGAG 

AAAATGATCA 



TTCAGATGAT 

GACCTGGTAT 

GTATATGAAA 

T AAC G AAAT A 

ATAATGGACG 

GTCGAAGTTA 

CGGCTATAAA 

ATGCATTGAG 

TAT CATC AAA 

AAAAGGTAAA 

ACACAATTTT 

CGACTACAAG 

TGATTTACGC 

GAAT C AAAGA 

GACGTGGCTA 

TTTCCAATAT 

ATGTACGTAC 

ATGACACGTG 

AAAAGATAAA 

TTGTGTCTGT 

AAAT C T AAAT 

TGCAGACAAA 

CACTTGTGAA 

AAAGCTCGTG 

TTTGCTATCT 

ATGAATTGTA 

GGACGAGACC 

TAATACAGAG 

ATACAATTAT 

GAAGATAGTA 

TGGTGCGCAT 

AACCGCTTGT 

AAATTGGAAA 

AGACGAAGAG 

TACAACGTTA 

ACGACGATGA 

TGAAGTGCGT 

AACCTAGACG 

GACCAAAGTA 

ATTTGATGAG 



TCAATACAGG 
GTATATTGGA 
• TTGTCGATAA 
GATGTAACAA 
TGGTATGCCA 
TCTTTACTGT 
ACTTCAGGTG 
TGAATGGCTT 
GTTTTAAAAA 
ACTAAGAAAA 
TAAAGCATCT 
AGTCTGCGTT 
AGTGGTAAAG 
GTTTGTTAGT 
CATTTTCAGG 
AATGATCAAT 
TAAAGATGGT 
TATTTAATGA 
AACTTAGATG 
TCGTATTCCA 
TGGGTACTTC 
TTGCCATTCT 
AAAAGCGATT 
AAGATGCTCG 
GGTAAATTAA 
TTTAGTCGAA 
GCAAATTCCA 
AAAGCACGTC 
CCACACAATC 
ATTATAATCG 
ATTCAAGTGC 
TCAAGCAGGT 
AAGGTAAAGG 
CTTAATAAAT 
CAAAGGTTTG 
ACCCAGAAAC 
TCATCTAAAC 
TGAATGGATT 
TTTTAGATAA 
GAGGAAATCT 



TTTTAGAGGG 50 

TCAACTGATA 100 

CTCCGTCGAT 150 

TTAAT AAAGA 200 

ACAGGTATAC 250 

TTTACATGCA 300 

GTCTTCACGG 350 

GAAGTTGAAA 400 

CGGTGGTTCG 4 50 

CAGGTACCAA 500 

ACATCATTTA 550 

CTTATTGAAA 600 

AGCGTCAAGA 650 

TATGTCAATG 700 

TGAAGCAAAT 750 

ATTCAGAAAG 800 

GGTACACATG 850 

TTATGCACGT 900 

GTAATGATAT 950 

GAAGAATTAT 1000 

TGAAGCTAGA 1050 

ATTTAGAAGA 1100 

AAAGCACAAC 1150 

TTCAGGTAAG 1200 

CACCTGCACA 1250 

GGTGATTCTG 1300 

AGCGATATTA 1350 

TAGAAGATAT 14 00 

GGGGCAGGCG 14 50 

TGTAATTATT 1500 

TATTGTTAAC 1550 

CGTGTATTTA 1600 

CAAAAC AAAG 1650 

TGCAAAAAGA 17 00 

GGTGAAATGA 1750 

ACGAACTTTA 1800 

GTGTAACAAC 1850 

GAAAAGCATG 1900 

TTCTGAAGTA 1950 

AG 1992 



50 



2) INFORMATION FOR SEQ ID NO: 1329 
(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 25 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1329 
TGTAGAGCGC GGTATCATCA AAGTA 



2) INFORMATION FOR SEQ ID NO: 1330 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1330 
AGATTCGAAC TTGGTGTGCG GG 



2) INFORMATION FOR SEQ ID NO: 1331 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1331 
GCCCTTGAGG TACAGAATGG TAATGAAGTT 



2) INFORMATION FOR SEQ ID NO: 1332 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic- acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 
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(ii) MOLECULE TYPE: DNA 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1332 
GACCGCGGCG CAGACCATCA 



2) INFORMATION FOR SEQ ID NO: 1333 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1333 
TCATGGTGAC TTATCTATTT ATG 



2) INFORMATION FOR SEQ ID NO: 1334 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1334 
CATCTATTTA TAAAGCAATG GTA 



2) INFORMATION FOR SEQ ID NO: 1335 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1335 
CTATTTATGG AGCAATGGT 
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2) INFORMATION FOR SEQ ID NO: 1336 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1336 
TGGAGACTAC TCAGTGT 



2) INFORMATION FOR SEQ ID NO: 1337 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1337 
TGGAGACTTC TCAGTGT 



2) INFORMATION FOR SEQ ID NO: 1338 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1338 
GTGTACGGAG CAATG 



2) INFORMATION FOR SEQ ID NO: 1339 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15 bases 

(B) TYPE: Nucleic acid 
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(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1339 
CCAGCGGAAA TGCGT 15 

2) INFORMATION FOR SEQ ID NO: 1340 



(i) SEQUENCE CHARACTERISTICS: 
15 (A) LENGTH: 25 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

20 (ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1340 
GAACAAGGTA TGACACCGGA TAAAT 25 

25 

2) INFORMATION FOR SEQ ID NO: 1341 

30 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

35 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1341 
40 GATAACTGAA ATCCTGAGCC ATACG 25 



2 ) INFORMATION FOR SEQ ID NO: 1342 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 
50 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1342 
GATGTTATTG GTCAATATCA TCCA 
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24 



2) INFORMATION FOR SEQ ID NO: 1343 

(i) SEQUENCE CHARACTERISTICS: 
10 (A) LENGTH: 29 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

15 (ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1343 
AAGAAACTGT CTCTTTATTA ATATCACGT 29 

20 

2) INFORMATION FOR SEQ ID NO: 1344 

25 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

3 0 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 134 4 
35 AGCAGCAACG ATGTTACGCA GCAG 24 



40 



2) INFORMATION FOR SEQ ID NO: 1345 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 
45 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1345 

50 

CCCGCCGAGC ATTTCAACTA TTG 23 
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TTCTAGCTGC GGCAGATGAG CGAAATCTGC 
TGGGCGATCG ATGCACGGCT AGGGCGTGTA 
TGATCTGACG TTTCCCGGCG AGAGGCGCGG 
AAATGCTCGG CGGGCGCGTC ATGGAGGAGT 
5 GAGATCGGGG ATGAGTTACT TGACTGCGAA 
AGCGTATGAA ATCGCGGAGG CTCCGCAGGG 
AGGGCGTCAT CGCCGGGCGG CCAGTCCGTT 
ATCTGGGATT ACTTTTACTA TGCCGATGAA 
TACAAAGCAC ATAGAGTCCT ACAGGCTCGC 
10 AAAAGGTTGA GGTCTTGCGT GCCGCTTTCA 



PCT/CAO0/0115O 

CGCTCTGGAT CGGTGGGGGC 300 

ACACGCAAGC ACGATGATAT 350 

CGAGCTCGAG GCAATAGTTG 4 00 

TGGACTATGG ATTCTTAGCG 4 50 

CCTGCTTGGT GGGCAGACGA 500 

CTCGTGCCCA GAGGCGGCTG 550 

GTAACAGCTG GGAGGCGATC 600 

GTACCACCAG TGGACTGGCC 650 

ATGCACCTCA CTCGGGGCGG 700 

GGTCGCGATA TGCGGCCTAA 7 50 



15 



2) INFORMATION FOR SEQ ID NO: 1349 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 
20 ( D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1349 

25 

CAGCCGACCA ATGAGTATCT TGCC 24 



30 2) INFORMATION FOR SEQ ID NO: 1350 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 bases 

(B) TYPE: Nucleic acid 
35 (C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 
40 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1350 

TAATCAGGGC AGTTGCGACT CCTA 2 4 



45 

2) INFORMATION FOR SEQ ID NO: 1351 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 531 bases 
50 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 
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(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Pseudomonas aeruginosa 
5 (B) STRAIN: Stone 130 

(C) ACCESSION NUMBER: L06157 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1351 

10 ATGTTATGGA GCAGCAACGA TGTTACGCAG CAGGGCAGTC GCCCTAAAAC 50 

AAAGTTAGGT GGCTCAATGA GCATCATTGC AACCGTCAAG ATCGGCCCTG 100 

ACGAAATTTC AGCCATGAGG GCTGTGCTCG ATCTCTTCGG CAAAGAGTTT 150 

GAGGACATTC CAACCTACTC TGATCGCCAG CCGACCAATG AGTATCTTGC 200 

CAATCTTCTG CACAGCGAGA CGTTCATCGC GCTCGCTGCT TTTGACCGCG 250 

15 GAACAGCAAT AGGTGGGCTC GCCGCCTACG TTCTACCCAA GTTCGAGCAA 300 

GCGCGAAGCG AGATCTACAT TTATGACTTG GCAGTCGCTT CCAGCCATCG 350 

AAGGCTAGGA GTCGCAACTG CCCTGATTAG CCACCTGAAG CGTGTGGCGG 4 00 

TTGAACTTGG CGCGTATGTA ATCTATGTGC AAGC AG AC TA CGGTGACGAT 4 50 

CCGGCAGTCG CTCTCTACAC AAAGCTTGGA GTTCGGGAAG ACGTCATGCA 500 

20 CTTCGACATT GATCCAAGAA CCGCCACCTA A 531 



2) INFORMATION FOR SEQ ID NO: 1352 



25 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 
30 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1352 

35 

CCACGCTGAC AGAGCCGCAC CG 22 



40 2) INFORMATION FOR SEQ ID NO: 1353 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 bases 

(B) TYPE: Nucleic acid 
45 (C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 
50 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1353 

GGCCAGCTCC CATCGGACCC TG 22 
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10 



15 



40 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 21 bases 
. (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1354 
CACGCTGACA GAGCCGCACC G 21 

2) INFORMATION FOR SEQ ID NO: 1355 



(i) SEQUENCE CHARACTERISTICS: 
20 (A) LENGTH: 24 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

25 (ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1355 
ATGCCGTTGC TGTCGAAATC CTCG 24 

30 



2) INFORMATION FOR SEQ ID NO: 1356 

35 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 810 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Serratia marcescens 
45 (C) ACCESSION NUMBER: M97172 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1356 

ATGAACACGA TCGAATCGAT CACGGCGGAC CTGCACGGAC TGGGCGTCCG 50 

50 GCCCGGCGAC CTGATCATGG TCCATGCATC GCTGAAAGCC GTCGGCCCGG 100 

TCGAGGGAGG TGCGGCCTCG GTGGTGTCGG CCCTTCGCGC CGCGGTCGGG 150 

TCCGCAGGGA CCCTGATGGG TTATGCCTCA TGGGACCGCT CGCCCTATGA 200 

GGAGACGCTG AACGGCGCGC GGATGGACGA AGAACTGCGC CGCCGGTGGC 250 
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CACCCTTCGA TCTGGCCACA TCCGGTACCT ATCCCGGCTT CGGCCTGCTC 300 

AACCGGTTTC TGCTTGAGGC GCCCGACGCA CGGCGCAGCG CGCATCCCGA 350 

CGCCTCCATG GTCGCGGTCG GCCCCCTTGC CGCCACGCTG ACAGAGCCGC 400 

ACCGGCTTGG GCAGGCGCTG GGCGAAGGCT CGCCGCTGGA GCGCTTCGTC 4 50 

5 GGGCATGGCG GAAAGGTCCT GCTTCTGGGA GCGCCGCTCG ACTCCGTCAC 500 

CGTGCTGCAT TACGCCGAGG CCATCGCCCC CATCCCGAAC AAACGCCGCG 550 

TGACCTATGA AATGCCGATG CTCGGCCCGG ATGGCAGGGT CCGATGGGAG 600 

CTGGCCGAGG ATTTCGACAG CAACGGCATT CTCGATTGCT TCGCGGTCGA 650 

TGGGAAGCCG GATGCCGTCG AGACGATCGC CAAGGCTTAT GTCGAACTGG 700 

10 GCCGGCATCG GGAAGGCATC GTCGGTCGCG CACCCTCCTA TCTGTTTGAA 750 

GCGCAGGATA TCGTCTCGTT CGGCGTCACC TATCTCGAAC AGCATTTCGG 8 00 

CGCGCCCTGA 810 



15 



25 



30 



2) INFORMATION FOR SEQ ID NO: 1357 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 20 bases 
20 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1357 
GCCCATCCAT TTGCCTTTGC 20 

2) INFORMATION FOR SEQ ID NO: 1358 



(i) SEQUENCE CHARACTERISTICS: 
35 (A) LENGTH: 2 4 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

40 (ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1358 
GCGTACCAAC TTGCCATCCT GAAG 24 

45 



2) INFORMATION FOR SEQ ID NO: 1359 

50 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 
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(D) TOPOLOGY: Linear 
(ii) MOLECULE TYPE: DNA 
5 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1359 

TGCCCCTGCC ACCTCACTC !9 



10 



20 



25 



2) INFORMATION FOR SEQ ID NO: 1360 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 24 bases 
15 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1360 
CGTACCAACT TGCCATCCTG AAGA 24 

2) INFORMATION FOR SEQ ID NO: 1361 



(i) SEQUENCE CHARACTERISTICS: 
30 (A) LENGTH: 786 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

35 (ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Escherichia coli 
(C) ACCESSION NUMBER: X01385 

40 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1361 



GTGCAATACG AATGGCGAAA AGCCGAGCTC 
TGGGGTTACC CCCGGCGGTG TGCTGCTGGT 

45 TCCGGCCCCT CGAAGATGGG CCACTTGGAC 
GCGCTGGGTC CGGGAGGGAC GCTCGTCATG 
CGACGAGCCG TTCGATCCTG CCACGTCGCC 
TTGTCTCTGA CACATTCTGG CGCCTGCCAA 
CCATTTGCCT TTGCGGCAGC GGGGCCACAG 

50 TCCATTGCCC CTGCCACCTC ACTCGCCTGC 
ATGAACTCGA TGGGCAGGTA CTTCTCCTCG 
ACGACGCTGC ATCTTGCCGA GTTGATGGCA 
GAGACACTGC ACCATTCTTC AGGATGGCAA 

72 



ATCGGTCAGC TTCTCAACCT 50 

CCACAGCTCC TTCCGTAGCG 100 

TGATCGAGGC CCTGCGTGCT 150 

CCCTCGTGGT CAGGTCTGGA 200 

CGTTACACCG GACCTTGGAG 250 

AT GT AAAGCG CAGCGCCCAT 300 

GCAGAGCAGA TCATCTCTGA 350 

AAGCCCGGTC GCCCGTGTCC 400 

GCGTGGGACA CGATGCCAAC 4 50 

AAGGTTCCCT ATGGGGTGCC 500 

GTTGGTACGC GTCGATTATC 550 
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TCGAGAATGA CCACTGCTGT GAGCGCTTTG CCTTGGCGGA CAGGTGGCTC 600 

AAGGAGAAGA GCCTTCAGAA GGAAGGTCCA GTCGGTCATG CCTTTGCTCG 650 

GTTGATCCGC TCCCGCGACA TTGTGGCGAC AGCCCTGGGT CAACTGGGCC 700 

GAGATCCGTT GATCTTCCTG CATCCGCCAG AGGGCGGGAT GCGAAGAATG 750 

CGATGCCGCT CGCCAGTCGA TTGGCTGAGC TCATGA 786 



2) INFORMATION FOR SEQ ID NO: 1362 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 
15 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

. (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1362 

20 

CGCCGCCATC GCCCAAAGCT GG .22 



25 2) INFORMATION FOR SEQ ID NO: 1363 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 bases 

(B) TYPE: Nucleic acid 
30 (C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 
35 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1363 

CGGCATAATG GAGCGCGGTG ACTG 24 



40 



50 



2 ) INFORMATION FOR SEQ ID NO: 1364 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 24 bases 
45 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1364 
TTTCTCGCCC ACGCAGGAAA AATC 24 
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2) INFORMATION FOR SEQ ID NO: 1365 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1365 
CATCCTCGAC GAATATGCCG CG 22 

15 

2) INFORMATION FOR SEQ ID NO: 1366 

(i) SEQUENCE CHARACTERISTICS: 
20 (A) LENGTH: 900 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

25 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterobacter cloacae 
(C) ACCESSION NUMBER: M88012 



30 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1366 



ATGACTGATC CCCGCAAAAA CGGCGATTTG CACGAACCCG CGACGGCACC 50 

CGCGACGCCC TGGTCCAAAA GCGAGCTGGT CCGGCAATTG CGCGACCTCG 100 

35 GCGTGCGCTC AGGCGATATG GTGATGCCGC ATGTGTCGTT GCGCGCCGTC 150 

GGGCCGCTGG CGGACGGACC GCAGACACTT GTCGATGCGC TGATCGAGGC 200 

CGTCGGCCCC ACCGGGAATA TTCTCGCCTT CGTCTCGTGG CGCGATTCGC 250 

CCTATGAACA GACGCTGGGT CATGATGCGC CGCCCGCCGC CATCGCCCAA 300 

AGCTGGCCTG CGTTCGACCC CGACCATGCG CCCGCCTACC CCGGCTTTGG 350 

40 CGCGATCAAC GAATTTATCC GAACCTATCC GGGGTGTCGG CGCACGGCCC 400 

ATCCCGACGC ATCGATGGCG GCGATCGGGC CCGATGCGGC GTGGCTGGTG 450 

GCGCCGCACG AGATGGGCGC CGCTTATGGC CCCCGCTCGC CGATCGCGCG 500 

TTTTCTCGCC CACGCAGGAA AAATCCTGTC GATCGGCGCC GGGCCCGATG 550 

CAGTCACCGC GCTCCATTAT GCCGAAGCGG TGGCGCGGAT CGAGGGCAAG 600 

45 CGCCGCGTCA CTTATTCGAT GCCCTTACTG CGCGAAGGCA AGCGCGTCTG 650 

GGTCACCACG TCCGACTGGG ATTCGAACGG CATCCTCGAC GAATATGCCG 700 

CGCCCGACGG CCCCGACGCG GTC GAACGGA TCGCCCGCGA CTATCTCGCC 750 

CGCACCAGGG TTGCGCAAGG CCCGGTCGGC GGCGCGCAAT CCCGGCTGAT 800 

CGACGCGGCC GATATCGTTT CCTTCGGCAT CGAATGGCTC GAGGCGCGCC 850 

50 ACGCCGCGCC AGCGGCGGCA GCGCTGAAGC CGAAACAACG CCGCGACTGA 900 
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2) INFORMATION FOR SEQ ID NO: 1367 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1367 
CAAATATACT AACAGAAGCG TTCA 



2) INFORMATION FOR SEQ ID NO: 1368 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13 68 
AGGATCTTGC CAATACCTTT AT 



2) INFORMATION FOR SEQ ID NO: 1369 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1369 
AAACCTTTGT TTCGGTCTGC TAAT 



2) INFORMATION FOR SEQ ID NO: 1370 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 bases 

(B) TYPE: Nucleic acid 
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(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1370 
AAGCGATTCC AATAATACCT TGCT 24 

2) INFORMATION FOR SEQ ID NO: 1371 



(i) SEQUENCE CHARACTERISTICS: 
15 (A) LENGTH: 558 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

20 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Cltrobacter diver sus 
(C) ACCESSION NUMBER: Ml 8 9 67 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1371 



ATGAATTATC AAATTGTGAA TATTGCGGAA TGCAGCAATT ATCAGTTAGA 50 

AGCAGCAAAT ATACTAACAG AAGCGTTCAA TGATCTTGGT AACAATTCAT 100 

3 0 GGCCAGATAT GACGAGTGCA ACAAAAGAAG TAAAAGAATG TATTGAGAGT 150 

CCAAACCTTT GTTTCGGTCT GCTAATAAAT AACTCCTTAG TTGGCTGGAT 2 00 

AGGCTTAAGG CCAATGTACA AGGAAACCTG GGAATT GCAT CCATTGGTTG 250 

TCAGACCAGA TTATCAAAAT AAAGGTATTG GCAAGATCCT GCTTAAGGAA 300 

TTAGAAAACA GAGCTAGAGA GCAAGGTATT ATTGGAATCG CTTTAGGAAC 350 

35 AGATGATGAA TACTATAGAA CAAGTCTCTC TTTAATAACT ATAACAGAAG 4 00 

ATAATATATT TGATTCAATA AAAAATATTA AAAATATTAA TAAACATCCA 4 50 

TATGAGTTTT ATCAGAAGAA TGGTTATTAT ATTGTTGGAA TAATTCCAAA 500 

TGCCAATGGT AAAAACAAAC CAGATATTTG GATGTGGAAA AGTTTAATCA 550 
AAGAGTAA 



558 



40 



2) INFORMATION FOR SEQ ID NO: 1372 

45 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 



50 



(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1372 

733 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 



PCT/CA00/01150 



GCTTTCGTTG CCTTTGCCGA GGTC 24 



2) INFORMATION FOR SEQ ID NO: 1373 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 23 bases 
10 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 



15 



20 



(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1373 
CACCCCTGTT GCTTCGCCCA CTC 23 

2) INFORMATION FOR SEQ ID NO: 1374 



(i) SEQUENCE CHARACTERISTICS: 
25 (A) LENGTH: 24 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

3 0 (ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1374 
AGATATTGGC TTCGCCGCAC CACA 24 

35 

2) INFORMATION FOR SEQ ID NO: 1375 

40 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

45 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1375 
50 CCCTGTTGCT TCGCCCACTC CTG 23 
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2) INFORMATION FOR SEQ ID NO: 1376 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 441 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Serratia marcescens 
(C) ACCESSION NUMBER: M94066 

15 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1376 

ATGATCGTCA TCTGCGACCA CGACAACCTC GACGCCTGGC TGGCGCTGCG 50 

CACCGCGCTG TGGCCCTCCG GCTCGCCTGA AGATCACCGC GCGGAAATGC 100 

GCGAGATATT GGCTTCGCCG CACCACACCG CGTTTATGGC GCGGGGGCTG 150 

20 GACGGCGCTT TCGTTGCCTT TGCCGAGGTC GCGCTGCGCT ACGATTACGT 200 

CAACGGCTGC GAATCGTCGC CGGTGGCGTT TTTGGAAGGA ATTTATACCG 250 

CCGAACGCGC CCGCCGCCAG GGCTGGGCCG CGCGCCTGAT CGCGCAGGTG 300 

CAGGAGTGGG CGAAGCAACA GGGGTGCAGC GAGCTGGCGT CGGATACCGA 350 

TATCGCCAAT CTGGACTCCC AGCGCCTGCA TGCGGCGCTG GGCTTTGCCG 400 

25 AAACGGAGCG AGTAGTGTTT TACCGCAAAA CGCTGGGCTG A 441 



30 



2 ) INFORMATION FOR SEQ ID NO: 1377 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 
35 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1377 

40 

GCCGTGGGTC GATGTTTGAT GTTA 24 



45 2) INFORMATION FOR SEQ ID NO: 1378 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 bases 

(B) TYPE: Nucleic acid 
50 (C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1378 
GCTCGATGAC GCCAACTACC TCTG 24 

5 

2) INFORMATION FOR SEQ ID NO: 1379 

10 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

15 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1379 
2 0 AGCAGCAACG ATGTTACGCA GCAG 24 



25 



2) INFORMATION FOR SEQ ID NO: 1380 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 
30 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1380 

35 

CGCTCGATGA CGCCAACTAC CTCT 24 



40 2) INFORMATION FOR SEQ ID NO: 1381 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 972 bases 

(B) TYPE: Nucleic acid 
45 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

50 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Escherichia coli 

(C) ACCESSION NUMBER: X02340 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1381 

GTGGTAACGG CGCAGTGGCG GTTTTCATGG CTTCTTGTTA TGACATGTTT 50 

TTTTGGGGTA CAGTCTATGC CTCGGGCATC CAAGCAGCAA GCGCGTTACG 100 

5 CCGTGGGTCG ATGTTTGATG TTATGGAGCA GCAACGATGT TACGCAGCAG 150 

GGCAGTCGCC CTAAAACAAA GTTAAACATC ATGAGGGAAG CGGTGATCGC 200 

CGAAGTATCG ACTCAACTAT CAGAGGTAGT TGGCGTCATC GAGCGCCATC 250 

TCGAACCGAC GTTGCTGGCC GTACATTTGT ACGGCTCCGC AGTGGATGGC 300 

GGCCTGAAGC CACACAGTGA TATTGATTTG CTGGTTACGG TGACCGTAAG 350 

10 GCTTGATGAA ACAACGCGGC GAGCTTTGAT CAACGACCTT TTGGAAACTT 4 00 

CGGCTTCCCC TGGAGAGAGC GAGATTCTCC GCGCTGTAGA AGTCACCATT 450 

GTTGTGCACG ACGACATCAT TCCGTGGCGT TATCCAGCTA AGCGCGAACT 500 

GCAATTTGGA GAATGGCAGC GCAATGACAT TCTTGCAGGT ATCTTCGAGC 550 

CAGCCACGAT CGACATTGAT CTGGCTATCT TGCTGACAAA AGCAAGAGAA 600 

15 CATAGCGTTG CCTTGGTAGG TCCAGCGGCG GAGGAACTCT TTGATCCGGT 650 

TCCTGAACAG GATCTATTTG AGGCGCTAAA TGAAACCTTA ACGCTATGGA 7 00 

ACTCGCCGCC CGACTGGGCT GGCGATGAGC GAAATGTAGT GCTTACGTTG 750 

TCCCGCATTT GGTACAGCGC AGTAACCGGC AAAATCGCGC CGAAGGATGT 800 

CGCTGCCGAC TGGGCAATGG AGCGCCTGCC GGCCCAGTAT CAGCCCGTCA 850 

20 TACTTGAA.GC TAGACAGGCT TATCTTGGAC AAGAAGAAGA TCGCTTGGCC 900 

TCGCGCGCAG ATCAGTTGGA AGAATTTGTC CACTACGTGA AAGGCGAGAT 950 

CACCAAGGTA GTCGGCAAAT AA 972 



25 

2) INFORMATION FOR SEQ ID NO: 1382 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 24 bases 
30 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 



35 



(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1382 
TAGATATGAT AGGCGGTAAA AAGC 24 

40 

2) INFORMATION FOR SEQ ID NO: 1383 

(i) SEQUENCE CHARACTERISTICS: 
45 (A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

50 (ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1383 
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2) INFORMATION FOR SEQ ID NO: 1384 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1384 
GATATGATAG GCGGTAAAAA GC 



2) INFORMATION FOR SEQ ID NO: 1385 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1385 
TCCCAAATTC GAGTAAGAGG TA 



2) INFORMATION FOR SEQ ID NO: 1386 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 477 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Staphylococcus aureus 
(C) ACCESSION NUMBER: V01282 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1386 

ATGAAAGAAA GATATGGAAC AGTATATAAA GGCTCTCAGA GGCTCATAGA 50 
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CGAGGAAAGT GGAGAAGTAA TAGAGGTAGA 
CGTCTGGTAA CTTTGTAAAA GCGTATATCG 
GATATGATAG GCGGTAAAAA GCTCAAGATT 
TGTACATCTA AGTAATAACA CAATGATAGC 
5 AAGGAACAAA TACAAGCACG AAAACCGTAA 
GAAGAAGGAA ATATCATTAA AAGAAGAACT 
AGAGCTACTC ATGAGAGGCG ATGACCAAAA 
AATTTGGGAA CTTTGAGCAA GAGGACGACC 
TCAGAATATT ATTCTTTCAA GGAGTAG 
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TAAGCTATAC CGTAAACAAA 100 

TCCAATTAAT AAGTATGTTA 150 

GTTAATTATA TATTAGATAA 200 

AACTGTTAGA GAAATAGCAG 250 

ATACAACGCT TAAAATCTTA 300 

GGAGCATTAA TGCTAAACCC 350 

ACAAAAATAC CTCTTACTCG 400 

AAAAGCAAGA AAATGCTTTA 4 50 

477 



2) INFORMATION FOR SEQ ID NO: 1387 

15 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 bases 

(B) TYPE: Nucleic acid 

(C) ' STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

20 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1387 
25 TTATGCCTCT TCCGACCATC AAGC 2 4 



30 



2) INFORMATION FOR SEQ ID NO: 1388 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 
35 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 138 8 

40 

TACGCTCGTC ATCAAAATCA CTCG 24 



45 2) INFORMATION FOR SEQ ID NO: 1389 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 bases 

(B) TYPE: Nucleic acid 
50 (C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1389 
GAATAACGGT TTGGTTGATG CGAG 24 

5 

2) INFORMATION FOR SEQ ID NO: 1390 

10 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

15 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1390 
20 ATGGCAAGAT CCTGGTATCG GTCT 24 



25 



2) INFORMATION FOR SEQ ID NO: 1391 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 816 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
30 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
35 (A) ORGANISM: Escherichia coli 

(C) ACCESSION NUMBER: J01839 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1391 

40 ATGAGCCATA TTCAACGGGA AACGTCTTGC TCGAGGCCGC GATTAAATTC 50 

CAACATGGAT GCTGATTTAT ATGGGTATAA ATGGGCTCGC GATAATGTCG 100 

GGCAATCAGG TGCGACAATC TATCGATTGT ATGGGAAGCC CGATGCGCCA 150 

GAGTTGTTTC TGAAACATGG CAAAGGTAGC GTTGCCAATG ATGTTACAGA 200 

TGAGATGGTC AGACTAAACT GGCTGACGGA ATTTATGCCT CTTCCGACCA 250 

45 TCAAGCATTT TATCCGTACT CCTGATGATG CATGGTTACT CACCACTGCG 300 

ATCCCCGGGA AAACAGCATT CCAGGTATTA GAAGAATATC CTGATTCAGG 350 

TGAAAATATT GTTGATGCGC TGGCAGTGTT CCTGCGCCGG TTGCATTCGA 400 

TTCCTGTTTG TAATTGTCCT TTTAACAGCG ATCGCGTATT TCGTCTCGCT 450 

CAGGCGCAAT CACGAATGAA TAACGGTTTG GTTGATGCGA GTGATTTTGA 500 

50 TGACGAGCGT AATGGCTGGC CTGTTGAACA AGTCTGGAAA GAAATGCATA 550 

AGCTTTTGCC ATTCTCACCG GATTCAGTCG TCACTCATGG TGATTTCTCA 600 

CTTGATAACC TTATTTTTGA CGAGGGGAAA TTAATAGGTT GTATTGATGT 650 

TGGACGAGTC GGAATCGCAG ACCGATACCA GGATCTTGCC ATCCTATGGA 700 

740 
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ACTGCCTCGG TGAGTTTTCT CCTTCATTAC AGAAACGGCT TTTTCAAAAA 750 
TATGGTATTG ATAATCCTGA TATGAATAAA TTGCAGTTTC ATTTGATGCT 800 
CGATGAGTTT TTCTAA 816 



2) INFORMATION FOR SEQ ID NO: 1392 

(i) SEQUENCE CHARACTERISTICS: 
10 (A) LENGTH: 24 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

15 (ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1392 
TGGGTGGAGA GGCTATTCGG CTAT 24 

20 

2) INFORMATION FOR SEQ ID NO: 1393 

25 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

30 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1393 
35 CAGTCCCTTC CCGCTTCAGT GAC 23 



40 



2) INFORMATION FOR SEQ ID NO: 1394 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 
45 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1394 

50 

GACGTTGTCA CTGAAGCGGG AAGG 24 
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2) INFORMATION FOR SEQ ID NO: 1395 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1395 
CTTGGTGGTC GAATGGGCAG GTAG 24 

2) INFORMATION FOR SEQ ID NO: 1396 



(i) SEQUENCE CHARACTERISTICS: 
20 (A) LENGTH: 795 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

25 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Escherichia coli 
(C) ACCESSION NUMBER: V00618 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1396 



ATGATTGAAC AAGATGGATT GCACGCAGGT TCTCCGGCCG CTTGGGTGGA 50 

GAGGCTATTC GGCTATGACT GGGCACAACA GACAATCGGC TGCTCTGATG 100 

35 CCGCCGTGTT CCGGCTGTCA GCGCAGGGGC GCCCGGTTCT TTTTGTCAAG 150 

ACCGACCTGT CCGGTGCCCT GAATGAACTG CAGGACGAGG CAGCGCGGCT 200 

ATCGTGGCTG GCCACGACGG GCGTTCCTTG CGCAGCTGTG CTCGACGTTG 250 

TCACTGAAGC GGGAAGGGAC TGGCTGCTAT TGGGCGAAGT GCCGGGGCAG 300 

GATCTCCTGT CATCTCACCT TGCTCCTGCC GAGAAAGTAT CCATCATGGC 350 

40 TGATGCAATG CGGCGGCTGC ATACGCTTGA TCCGGCTACC TGCCCATTCG 4 00 

ACCACCAAGC GAAACATCGC ATCGAGCGAG CACGTACTCG GATGGAAGCC 4 50 

GGTCTTGTCG ATCAGGATGA TCTGGACGAA GAGCATCAGG GGCTCGCGCC 500 

AGCCGAACTG TTCGCCAGGC TCAAGGCGCG CATGCCCGAC GGCGAGGATC 550 

TCGTCGTGAC CCATGGCGAT GCCTGCTTGC C GAATATC AT GGT GGAAAAT. 600 

45 GGCCGCTTTT CTGGATTCAT CGACTGTGGC CGGCTGGGTG TGGCGGACCG 650 

CTATCAGGAC ATAGCGTTGG CTACCCGTGA TATTGCTGAA GAGCTTGGCG 700 

GCGAATGGGC TGACCGCTTC CTCGTGCTTT ACGGTATCGC CGCTCCCGAT 750 

TCGCAGCGCA TCGCCTTCTA TCGCCTTCTT GACGAGTTCT TCTGA 795 



2) INFORMATION FOR SEQ ID NO: 1397 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1397 
GTGGGAGAAA ATGAAAACCT AT 



2) INFORMATION FOR SEQ ID NO: 1398 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1398 
ATGGAGTGAA AGAGCCTGAT 



2) INFORMATION FOR SEQ ID NO: 1399 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 bases 

(B) TYPE: Nucleic acid 
.(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1399 
ACCTATGATG TGGAACGGGA AAAG 



2) INFORMATION FOR SEQ ID NO: 1400 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 
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(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1400 
CGATGGAGTG AAAGAGCCTG ATG . 23 



10 2) INFORMATION FOR SEQ ID NO: 1401 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7 95 bases 

(B) TYPE: Nucleic acid 
15 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

20 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus faecalis 
<C) ACCESSION NUMBER: V01547 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1401 



ATGGCTAAAA TGAGAATATC ACCGGAATTG AAAAAACTGA TCGAAAAATA 50 

CCGCTGCGTA AAAGATACGG AAGGAATGTC TCCTGCTAAG GTATATAAGC 100 

TGGTGGGAGA AAATGAAAAC CTATATTTAA AAAT GACGGA CAGCCGGTAT 150 

AAAGGGACCA CCTATGATGT GGAACGGGAA AAGGACATGA TGCTATGGCT 200 

30 GGAAGGAAAG CTGCCTGTTC CAAAGGTCCT GCACTTTGAA CGGCATGATG 250 

GCTGGAGCAA TCTGCTCATG AGTGAGGCCG ATGGCGTCCT TTGCTCGGAA 300 

GAGTATGAAG ATGAACAAAG CCCTGAAAAG ATTATCGAGC TGTATGCGGA 350 

GTGCATCAGG CTCTTTCACT CCATCGACAT ATCGGATTGT CCCTATACGA 4 00 

ATAGCTTAGA CAGCCGCTTA GCCGAATTGG ATTACTTACT GAATAACGAT 450 

35 CTGGCCGATG TGGATTGCGA AAAC TGGGAA GAAGACACTC CATTTAAAGA 500 

TCCGCGCGAG CTGTATGATT TTTTAAAGAC GGAAAAGCCC GAAGAGGAAC 550 

TTGTCTTTTC CCACGGCGAC CTGGGAGACA GCAACATCTT TGTGAAAGAT 600 

GGCAAAGTAA GTGGCTTTAT TGATCTTGGG AGAAGCGGCA GGGCGGACAA 650 

GTGGTATGAC ATTGCCTTCT GCGTCCGGTC GATCAGGGAG GATATCGGGG 700 

40 AAGAACAGTA TGTCGAGCTA TTTTTTGACT TACTGGGGAT CAAGCCTGAT 750 

TGGGAGAAAA TAAAATATTA TATTTTACTG GATGAATTGT TTTAG 7 95 



45 2 ) INFORMATION FOR SEQ ID NO: 1402 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 bases 

(B) TYPE: Nucleic acid 
50 (C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear • 

(ii) MOLECULE TYPE: DNA 

744 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1402 
TATTCAACAA TTTATCGGAA ACAG 24 



2) INFORMATION FOR SEQ ID NO: 1403 

10 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

15 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1403 
20 TCAGAGAGCC AACTCAACAT TTT 23 



25 



2) INFORMATION FOR SEQ ID NO: 1404 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 
30 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1404 

35 

AAACAGCGTT TTAGAGCCAA ATAA 24 



40 2) INFORMATION FOR SEQ ID NO: 1405 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 bases 

(B) TYPE: Nucleic acid 
45 (C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 
50 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1405 

TTCTCAGAGA GCCAACTCAA CATT 24 
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2) INFORMATION FOR SEQ ID NO: 14 06 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 780 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Acinetobacter baumannii 

(B) STRAIN: BM2580 

(C) ACCESSION NUMBER: X07753 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1406 

ATGGAATTGC CCAATATTAT TCAACAATTT ATCGGAAACA GCGTTTTAGA 50 

GCCAAATAAA ATTGGTCAGT CGCCATCGGA TGTTTATTCT TTTAATCGAA 100 

ATAATGAAAC TTTTTTTCTT AAGCGATCTA GCACTTTATA TACAGAGACC 150 

AC AT AC AG T G TCTCTCGTGA AGCGAAAATG TTGAGTTGGC TCTCTGAGAA 200 

ATTAAAGGTG CCTGAACTCA TCATGACTTT TCAGGATGAG CAGTTTGAAT 250 

TCATGATCAC TAAAGCGATC AATGCAAAAC CAATTTCAGC GCTTTTTTTA 300 

ACAGACCAAG AATTGCTTGC TATCTATAAG GAGGCACTCA ATCTGTTAAA 350 

TTCAATTGCT ATTATTGATT GTCCATTTAT TTCAAACATT GATCATCGGT 400 

TAAAAGAGTC AAAATTTTTT ATTGATAACC AACTCCTTGA CGATATAGAT 4 50 

CAAGATGATT TTGACACTGA ATTATGGGGA GACCATAAAA CTTACCTAAG 500 

TCTATGGAAT GAGTTAACCG AGACTCGTGT TGAAGAAAGA TTGGTTTTTT 550 

CTCATGGCGA TATCACGGAT AGTAATATTT TTATAGATAA ATTCAATGAA 600 

ATTTATTTTT TAGATCTTGG TCGTGCTGGG TTAGCAGATG AATTTGTAGA 650 

TATATCCTTT GTTGAACGTT GCCTAAGAGA GGATGCATCG GAGGAAACTG 700 

CGAAAATATT TTTAAAGCAT TTAAAAAATG ATAGACCTGA CAAAAGGAAT 7 50 

TATTTTTTAA AACTTGATGA ATTGAATTGA 7 80 



2) INFORMATION FOR SEQ ID NO: 1407 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1407 
CCCTGTAATA GAAAAGCAAG TAGG 



2) INFORMATION FOR SEQ ID NO: 1408 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1408 
TTGTCGTATC CCTCAAATCA CC 



2) INFORMATION FOR SEQ ID NO: 1409 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1409 
TGGGATTACA ATGGCAATCA GCG 



2) INFORMATION FOR SEQ ID NO: 1410 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1410 
GGGGAATAGG TCACAAGATC TGCTT 



2) INFORMATION FOR SEQ ID NO: 1411 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 912 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
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(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Pseudomonas aeruginosa 
(C) ACCESSION NUMBER: S46063 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1411 

ATGCTTTTAT ATAAAATGTG TGACAATCAA AATTATGGGG TTACTTACAT 50 

GAAGTTTTTA TTGGCATTTT CGCTTTTAAT ACCATCCGTG GTTTTTGCAA 100 

GTAGTTCAAA GTTTCAGCAA GTTGAACAAG ACGTTAAGGC AATTGAAGTT 150 

TCTCTTTCTG CTCGTATAGG TGTTTCCGTT CTTGATACTC AAAAT GGAGA 200 

ATATTGGGAT TACAATGGCA ATCAGCGCTT CCCGTTAACA AGTACTTTTA 250 

AAACAATAGC TTGCGCTAAA TTACTATATG ATGCTGAGCA AGGAAAAGTT 300 

AATCCCAATA GTACAGTCGA GAT T AAG AAA GCAGATCTTG TGACCTATTC 350 

CCCTGTAATA GAAAAGCAAG TAGGGCAGGC AATCACACTC GATGATGCGT 4 00 

GCTTCGCAAC TATGACTACA AGTGATAATA CTGCGGCAAA TATCATCCTA 4 50 

AGTGCTGTAG GTGGCCCCAA AGGCGTTACT GATTTTTTAA GACAAATTGG 500 

GGACAAAGAG ACTCGTCTAG ACCGTATTGA GCCTGATTTA AATGAAGGTA 550 

AGCTCGGTGA TTTGAGGGAT ACGACAACTC CTAAGGCAAT AGCCAGTACT 600 

TTGAATAAAC TTTTATTTGG TTCCGCGCTA TCTGAAATGA ACCAGAAAAA 650 

ATTAGAGTCT TGGATGGTGA ACAATCAAGT CACTGGTAAT TTACTACGTT 700 

CAGTATTGCC GGCGGGATGG AACATTGCGG ATCGCTCAGG TGCTGGCGGA 750 

TTTGGTGCTC GGAGTATTAC AGCAGTTGTG TGGAGTGAGC ATCAAGCCCC 8 00 

AATTATTGTG AGCATCTATC TAGCTCAAAC ACAGGCTTCA ATGGCAGAGC 850 

GAAATGATGC GATTGTTAAA ATTGGTCATT CAATTTTTGA CGTTTATACA 900 

TCACAGTCGC GC 912 



2) INFORMATION FOR SEQ ID NO: 1412 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1412 
GAGAAAACGC TCCAGCAGGG C 



2) INFORMATION FOR SEQ ID NO: 1413 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 bases 

(B) TYPE: Nucleic acid 
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(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1413 
CATGAGGCTT TCACTGCGGG G 



2) INFORMATION FOR SEQ ID NO: 1414 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1414 
TATCGTTAAT CGCACCATCA C 



2) INFORMATION FOR SEQ ID NO: 1415 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1415 
ATGCAGTAAT GCGGCTTTAT C 



2) INFORMATION FOR SEQ ID NO: 1416 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 114 6 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 
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(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Klebsiella pneumoniae 

(B) STRAIN: HEL-1 

(C) ACCESSION NUMBER: X91840 

5 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1416 

ATGATGAAAA AATCGTTATG CTGCGCTCTG CTGCTGACAG CCTCTTTCTC 50 

CACATTTGCT GCCGCAAAAA CAGAACAACA GATTGCCGAT ATCGTTAATC 100 

10 GCACCATCAC CCCGTTGATG CAGGAGCAGG CTATTCCGGG TATGGCCGTT 150 

GCCGTTATCT ACCAGGGAAA ACCCTATTAT TTCACCTGGG GTAAAGCCGA 200 

TATCGCCAAT AACCACCCAG TCACGCAGCA AACGCTGTTT GAGCTAGGAT 250 

CGGTTAGTAA GACGTTTAAC GGCGTGTTGG GCGGCGATGC TATCGCCCGC 300 

GGCGAAATTA AGCTCAGCGA TCCGGTCACG AAATACTGGC CAGAACTGAC 350 

15 AGGCAAACAG TGGCAGGGTA TCCGCCTGCT GCACTTAGCC ACCTATACGG 4 00 

CAGGCGGCCT ACCGCTGCAG ATCCCCGATG ACGTTAGGGA TAAAGCCGCA 4 50 

TTACTGCATT TTTATCAAAA CTGGCAGCCG CAATGGACTC CGGGCGCTAA 500 

GCGACTTTAC GCTAACTCCA GCATTGGTCT GTTTGGCGCG CTGGCGGTGA 550 

AACCCTCAGG AATGAGTTAC GAAGAGGCAA TGACCAGACG CGTCCTGCAA 600 

2 0 CCATTAAAAC TGGCGCATAC CTGGATTACG GTTCCGCAGA ACGAACAAAA 650 

AGATTATGCC TGGGGCTATC GCGAAGGGAA GCCCGTACAC GTTTCTCCGG 700 

GACAACTTGA CGCCGAAGCC TATGGCGTGA AATCCAGCGT TATTGATATG 750 

GCCCGCTGGG TTCAGGCCAA CATGGATGCC AGCCACGTTC AGGAGAAAAC 800 

GCTCCAGCAG GGCATTGCGC TTGCGCAGTC TCGCTACTGG CGTATTGGCG 850 

25 ATATGTACCA GGGATTAGGC TGGGAGATGC TGAACTGGCC GCTGAAAGCT 900 

GATTCGATCA TCAACGGCAG CGACAGCAAA GTGGCATTGG CAGCGCTTCC 950 

CGCCGTTGAG GTAAACCCGC CCGCCCCCGC AGTGAAAGCC TCATGGGTGC 1000 

ATAAAACGGG CTCCACTGGT GGATTTGGCA GCTACGTAGC CTTCGTTCCA 1050 

GAAAAAAACC TTGGCATCGT GATGCTGGCA AACAAAAGCT ATCCTAACCC 1100 

30 TGTCCGTGTC GAGGCGGCCT GGCGCATTCT TGAAAAGCTG CAATAA 114 6 



2) INFORMATION FOR SEQ ID NO: 1417 



35 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 
4 0 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1417 

45 

TGGTTAACTA YAATCCSATT GCGGA 25 



50 2) INFORMATION FOR SEQ ID NO: 1418 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH : 22 bases 
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(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi> SEQUENCE DESCRIPTION: SEQ ID NO: 1418 
ATGCTTTACC CAGCGTCAGA TT 



2) INFORMATION FOR SEQ ID NO: 1419 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1419 
CGATGAATAA GCTGATTTCT CACG 



2) INFORMATION FOR SEQ ID NO: 1420 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1420 
TGCTTTACCC AGCGTCAGAT TACG 



2) INFORMATION FOR SEQ ID NO: 1421 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1421 
AATTAGAGCG GCAGTCGGGA GGAA 24 

5 

2) INFORMATION FOR SEQ ID NO: 1422 

10 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

15 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1422 
2 0 GAAAT CAGCT TATTCATCGC CACG 2 4 



25 



40 



2) INFORMATION FOR SEQ ID NO: 1423 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 876 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
3 0' (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
35 (A) ORGANISM: Escherichia coli 

(B) STRAIN: GRI-1 

(C) ACCESSION NUMBER: X92506 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1423 



ATGGTTAAAA AATCACTGCG TC AG T TC AC G CTGATGGCGA CGGCAACCGT 50 

CACGCTGTTG TTAGGAAGTG TGCCGCTGTA TGCGCAAACG GCGGACGTAC 100 

AGCAAAAACT TGCCGAATTA GAGCGGCAGT CGGGAGGAAG ACTGGGTGTG 150 

GCATTGATTA ACACAGCAGA TAATTCGCAA ATACTTTATC GTGCTGATGA 2 00 

45 GCGCTTTGCG ATGTGCAGCA CCAGTAAAGT GATGGCCGTG GCCGCGGTGC 250 

TGAAGAAAAG TGAAAGCGAA CCGAATCTGT TAAATCAGCG AGTTGAGATC 300 

AAAAAATCTG ACTTGGTTAA CTATAATCCG ATTGCGGAAA AGCACGTCGA 350 

TGGGACGATG TCACTGGCTG AGCTTAGCGC GGCCGCGCTA CAGTACAGCG 4 00 

ATAACGTGGC GATGAATAAG CTGATTTCTC ACGTTGGCGG CCCGGCTAGC 4 50 

50 GTCACCGCGT TCGCCCGACA GCTGGGAGAC GAAACGTTCC GTCTCGACCG 500 

TACCGAGCCG ACGTTAAACA CCGCCATTCC GGGCGATCCG CGTGATACCA 550 

CTTCACCTCG GGCAATGGCG CAAACTCTGC GTAATCTGAC GCTGGGTAAA 600 

GCATTGGGTG ACAGCCAACG GGCGCAGCTG GTGACATGGA TGAAAGGCAA 650 
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TACCACCGGT GCAGCGAGCA TTCAGGCTGG ACTGCCTGCT TCCTGGGTTG 700 

TGGGGGATAA AACCGGCAGC GGTGACTATG GCACCACCAA CGATATCGCG 750 

GTGATCTGGC CAAAAGATCG TGCGCCGCTG ATTCTGGTCA CTTACTTCAC 800 

CCAGCCTCAA CCTAAGGCAG AAAGCCGTCG CGATGTATTA GCGTCGGCGG 850 

5 CTAAAATCGT CACCAACGGT TTGTAA 876 



10 



2) INFORMATION FOR SEQ ID NO: 1424 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 
15 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1424 

20 

GTTAACGGTG ATGGCGACGC TAC 23 



25 2) INFORMATION FOR SEQ ID NO: 1425 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 bases 

(B) TYPE: Nucleic acid 

3 0 (C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

35 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1425 

GAATTATCGG CGGTGTTAAT CAGC 24 



40 



50 



2) INFORMATION FOR SEQ ID NO: 1426 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 22 bases 
45 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 142 6 

CACGCTCAAT ACCGCCATTC CA 22 

753 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 



PCT/CA00/01150 



10 



15 



2) INFORMATION FOR SEQ ID NO: 1427 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1427 
TTATCGCCCA CTACCCATGA TTTC 24 

2) INFORMATION FOR SEQ ID NO: 1428 



(i) SEQUENCE CHARACTERISTICS: 
20 (A) LENGTH: 876 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

25 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Salmonella typhlmurium 

(B) STRAIN: CAS-5 

30 (C) ACCESSION NUMBER: X92507 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1428 

ATGATGACTC AGAGCATTCG CCGCTCAATG TTAACGGTGA TGGCGACGCT 50 

35 ACCCCTGCTA TTTAGCAGCG CAACGCTGCA TGCGCAGGCG AACAGCGTGC 100 

AACAGCAGCT GGAAGCCCTG GAGAAAAGTT CGGGAGGTCG GCTTGGCGTT 150 

GCGCTGATTA ACACCGCCGA TAATTCGCAG ATTCTCTACC GTGCCGATGA 200 

ACGTTTTGCG ATGTGCAGTA CCAGTAAGGT GATGGCGGCC GCGGCGGTGC 250 

TTAAACAGAG CGAGAGCGAT AAGCACCTGC TAAATCAGCG CGTTGAAATC 300 

40 AAGAAGAGCG ACCTGGTTAA CTACAATCCC ATTGCGGAGA AACACGTTAA 350 

CGGCACGATG ACGCTGGCTG AGCTTGGCGC AGCGGCGCTG CAGTATAGCG 400 

ACAATACTGC CATGAATAAG CTGATTGCCC ATCTGGGTGG TCCCGATAAA 450 

GTGACGGCGT TTGCTCGCTC GTTGGGTGAT GAGACCTTCC GTCTGGACAG 500 

AACCGAGCCC ACGCTCAATA CCGCCATTCC AGGCGACCCG CGTGATACCA 550 

45 CCACGCCGCT CGCGATGGCG CAGACCCTGA AAAATCTGAC GCTGGGTAAA 600 

GCGCTGGCGG AAACTCAGCG GGCACAGTTG GTGACGTGGC TTAAGGGCAA 650 

TACTACCGGT AGCGCGAGCA TTCGGGCGGG TCTGCCGAAA TCATGGGTAG 700 

TGGGCGATAA AACCGGCAGC GGAGATTATG GCACCACCAA CGATATCGCG 750 

GTTATCTGGC CGGAAAACCA CGCACCGCTG GTTCTGGTGA CCTACTTTAC 800 

50 CCAACCGGAG CAGAAGGCGG AAAGCCGTCG GGATATTCTG GCTGCGGCGG 850 

CGAAAATCGT AACCCACGGT TTCTGA 87 6 
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2) INFORMATION FOR SEQ ID NO: 1429 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1429 
TTTACGGCTA AAGATACTGA AAAGT 25 

2) INFORMATION FOR SEQ ID NO: 1430 



(i) SEQUENCE CHARACTERISTICS: 
20 (A) LENGTH: 26 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

25 (ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1430 
GTTTAATAAA ACAACCACCG AATAAT 26 

30. 

2) INFORMATION FOR SEQ ID NO: .1431 

35 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

40 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1431 
45 TAATTGACAC TCCATTTACG GCTAA 25 



2) INFORMATION FOR SEQ ID NO: 1432 

50 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 bases 

(B) TYPE: Nucleic acid 
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(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1432 
ACCGAATAAT ATTTTCCTTT CAGGCA 26 

2) INFORMATION FOR SEQ ID NO: 1433 



(i) SEQUENCE CHARACTERISTICS: 
15 (A) LENGTH: 741 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

20 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Pseudomonas aeruginosa 
(C) ACCESSION NUMBER: AJ223604 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1433 



ATGAGCAAGT TATCTGTATT CT T TAT AT TT TTGTTTTGCA GCATTGCTAC 50 

CGCAGCAGAG TCTTTGCCAG ATTTAAAAAT TGAAAAGCTT GATGAAGGCG 100 

30 TTTATGTTCA TACTTCGTTT GAAGAAGTTA ACGGGTGGGG CGTTGTTCCT 150 

AAACATGGTT TGGTGGTTCT TGTAAATGCT GAGGCTTACC TAATTGACAC 200 

TCCATTTACG GCTAAAGATA CTGAAAAGTT AGTCACTTGG TTTGTGGAGC 250 

GTGGCTATAA AATAAAAGGC AGCATTTCCT CTCATTTTCA TAGCGACAGC 300 

ACGGGCGGAA TAGAGTGGCT TAATTCTCGA TCTATCCCCA CGTATGCATC 350 

35 TGAATTAACA AATGAACTGC TTAAAAAAGA CGGTAAGGTT CAAGCCACAA 4 00 

ATTCATTTAG CGGAGTTAAC TATTGGCTAG TTAAAAATAA AATTGAAGTT 450 

TTTTATCCAG GCCCGGGACA CACTCCAGAT AACGTAGTGG TTTGGTTGCC 500 

TGAAAGGAAA ATATTATTCG GTGGTTGTTT TATTAAACCG TACGGTTTAG 550 

GCAATTTGGG TGACGCAAAT ATAGAAGCTT GGCCAAAGTC CGCCAAATTA 600 

40 TTAAAGTCCA AATATGGTAA GGCAAAACTG GTTGTTCCAA GTCACAGTGA 650 

AGTTGGAGAC GCATCACTCT TGAAACTTAC ATTAGAGCAG GCGGTTAAAG 700 

GGT T AAACGA AAGTAAAAAA CCATCAAAAC CAAGCAACTA A 741 



45 

2) INFORMATION FOR SEQ ID NO: 1434 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 25 bases 
50 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1434 
5 CACAATCAAG ACCAAGATTT GCGAT 25 



10 



40 



50 



2) INFORMATION FOR SEQ ID NO: 1435 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 
15 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1435 

20 

GAAAGGGCAG CTCGTTACGA TAGAG 25 

25 2) INFORMATION FOR SEQ ID NO: 1436 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 bases 

(B) TYPE: Nucleic acid 
30 (C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 
35 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1436 

CAGCATCAAC ATTTAAGATC CCCA 24 



2) INFORMATION FOR SEQ ID NO: 1437 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 2 6 bases 
45 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1437 

CTCCACTTGA TTAACTGCGG AAATTC 26 
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2) INFORMATION FOR SEQ ID NO: 1438 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 828 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Escherichia coli 
(C) ACCESSION NUMBER: X06046 

15 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1438 

ATGGCAATCC GAATCTTCGC GATACTTTTC TCCATTTTTT CTCTTGCCAC 50 

TTTCGCGCAT GCGCAAGAAG GCACGCTAGA ACGTTCTGAC TGGAGGAAGT 100 

TTTTCAGCGA ATTTCAAGCC AAAGGCACGA TAGTTGTGGC AGACGAACGC 150 

20 CAAGCGGATC GTGCCATGTT GGTTTTTGAT CCTGTGCGAT CGAAGAAACG 200 

CTACTCGCCT GCATCGACAT TCAAGATACC TCATACACTT TTTGCACTTG 250 

ATGCAGGCGC TGTTCGTGAT GAGTTCCAGA TTTTTCGATG GGACGGCGTT 300 

AACAGGGGCT TTGCAGGCCA CAATCAAGAC CAAGATTTGC GATCAGCAAT 350 

GCGGAATTCT ACTGTTTGGG TGTATGAGCT ATTTGCAAAG GAAATTGGTG 400 

25 AT G ACAAAGC TCGGCGCTAT TTGAAGAAAA TCGACTATGG CAACGCCGAT 450 

CCTTCGACAA GTAATGGCGA TTACTGGATA GAAGGCAGCC TTGCAATCTC 500 

GGCGCAGGAG CAAATTGCAT TTCTCAGGAA GCTCTATCGT AACGAGCTGC 550 

CCTTTCGGGT AGAACATCAG CGCTTGGTCA AGGATCTCAT GATTGTGGAA 600 

GCCGGTCGCA ACTGGATACT GCGTGCAAAG ACGGGCTGGG AAGGCCGTAT 650 

30 GGGTTGGTGG GTAGGATGGG TTGAGTGGCC GACTGGCTCC GTATTCTTCG 700 

CACTGAATAT TGATACGCCA AACAGAATGG ATGATCTTTT CAAGAGGGAG 750 

GCAATCGTGC GGGCAATCCT TCGCTCTATT GAAGCGTTAC CGCCCAACCC 800 

GGCAGTCAAC TCGGACGCTG CGCGATAA 828 



35 



50 



2) INFORMATION FOR SEQ ID NO: 1439 



(i) SEQUENCE CHARACTERISTICS: 
40 (A) LENGTH: 801 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

45 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Pseudomonas aeruginosa 
(C) ACCESSION NUMBER: J03427 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1439 

ATGAAAACAT TTGCCGCATA TGTAATTATC GCGTGTCTTT CGAGTACGGC 50 

758 
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ATTAGCTGGT TCAATTACAG AAAATACGTC TTGGAACAAA GAGTTCTCTG 100 

CCGAAGCCGT CAATGGTGTC TTCGTGCTTT GTAAAAGTAG CAGTAAATCC 150 

TGCGCTACCA ATGACTTAGC TCGTGCATCA AAGGAATATC TTCCAGCATC 200 

AACATTTAAG ATCCCCAACG CAATTATCGG CCTAGAAACT GGTGTCATAA 250 

5 AGAATGAGCA TCAGGTTTTC AAATGGGACG GAAAGCCAAG AGCCATGAAG 300 

CAATGGGAAA GAGACTTGAC CTTAAGAGGG GCAATACAAG TTTCAGCTGT 350 

TCCCGTATTT CAACAAATCG CCAGAGAAGT TGGCGAAGTA AGAATGCAGA 4 00 

AATACCTTAA AAAATTTTCC TATGGCAACC AGAATATCAG TGGTGGCATT 4 50 

GACAAATTCT GGTTGGAAGG CCAGCTTAGA ATTTCCGCAG TTAATCAAGT 500 

10 GGAGTTTCTA GAGTCTCTAT ATTTAAATAA ATTGTCAGCA TCTAAAGAAA 550 
ACCAGCTAAT AGTAAAAGAG GCTTTGGTAA CGGAGGCGGC ACCTGAATAT ■ 600 

CTAGTGCATT CAAAAACTGG TTTTTCTGGT GTGGGAACTG AGTCAAATCC 650 

TGGTGTCGCA TGGTGGGTTG GGTGGGTTGA GAAGGAGACA GAGGTTTACT 700 

TTTTCGCCTT TAACATGGAT ATAGACAACG AAAGTAAGTT GCCGCTAAGA 750 

15 AAATCCATTC CCACCAAAAT CATGGAAAGT GAGGGCATCA TTGGTGGCTA 800 

A 801 



20 2) INFORMATION FOR SEQ ID NO: 1440 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 bases 

(B) TYPE: Nucleic acid 
25 (C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 
30 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1440 

AGACCGTTAT CGTAAACAGG GCTAAG 26 



35 

2) INFORMATION FOR SEQ ID NO: 1441 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 26 bases 
40 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 



45 



(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1441 
TTTTTTGCTC AAACTTTTTC AGGATC 26 

50 

2) INFORMATION FOR SEQ ID NO: 1442 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 927 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
5 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
10 (A) ORGANISM: Pseudomonas aeruginosa 

(B) STRAIN: RNL-1 
.(C) ACCESSION NUMBER: Z21957 



15 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1442 



ATGAATGTCA TTATAAAAGC TGTAGTTACT GCCTCGACGC TACT GAT GGT 50 

ATCTTTTAGT TCATTCGAAA CCTCAGCGCA ATCCCCACTG TTAAAAGAGC 100 

AAATTGAATC CATAGTCATT GGAAAAAAAG CCACTGTAGG CGTTGCAGTG 150 

TGGGGGCCTG ACGATCTGGA ACCTTTACTG ATTAATCCTT TTGAAAAATT 200 

20 CCCAATGCAA AG T GT AT TT A AATTGCATTT AGCTATGTTG GTACTGCATC 250 

AGGTTGATCA GGGAAAGTTG GAT TTAAATC AGACCGTTAT CGTAAACAGG 300 

GCTAAGGTTT TACAGAATAC CTGGGCTCCG ATAATGAAAG CGTATCAGGG 350 

AGACGAGTTT AGTGTTCCAG TGCAGCAACT GCTGCAATAC TCGGTCTCGC 4 00 

ACAGCGATAA CGTGGCCTGT GATTTGTTAT TTGAACTGGT TGGTGGACCA 4 50 

25 GCTGCTTTGC AT G ACT AT AT CCAGTCTATG GGTATAAAGG AGACCGCTGT 500 

GGTCGCAAAT GAAGCGCAGA TGCACGCCGA TGATCAGGTG CAGTATCAAA 550 

ACTGGACCTC GATGAAAGGT GCTGCAGAGA TCCTGAAAAA GTTTGAGCAA 600 

AAAACACAGC TGTCTGAAAC CTCGCAGGCT TTGTTATGGA AGTGGATGGT 650 

CGAAACCACC AC AGG AC C AG AGCGGTTAAA AGGTTTGTTA CCAGCTGGTA 700 

3 0 CTGTGGTCGC ACATAAAACT GGTACTTCGG GTATCAAAGC CGGAAAAACT 750 

GCGGCCACTA AT GATTTAGG TATCATTCTG TTGCCTGATG GACGGCCCTT 800 

GCTGGTTGCT GTTTTTGTGA AAGACTCAGC CGAGTCAAGC CGAACCAATG 850 

AAGCTATCAT TGCGCAGGTT GCTCAGACTG CGTATCAATT TGAATTGAAA 900 

AAGCTTTCTG CCCTAAGCCC AAATTAA 927 

35 



2) INFORMATION FOR SEQ ID NO: 14 4 3 

40 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

45 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1443 
50 CTTCTGCTCT GCTGATGCTT GGC 23 
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2) INFORMATION FOR SEQ ID NO: 1444 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 2 6 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14 4 4 
GGCGACCAGG TATTTTGTAA TACTGC 26 



2) INFORMATION FOR SEQ ID NO: 1445 



(i) SEQUENCE CHARACTERISTICS: 
20 (A) LENGTH: 927 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

25 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Salmonella typhimurium 

(B) STRAIN: JMC 

30 (C) ACCESSION NUMBER: X93314 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1445 

ATGAATGTCA TCACAAAATG TGTTTTCACC GCTTCTGCTC TGCTGATGCT 50 

35 TGGCTTAAGT TCATTTGTAG TATCAGCCCA ATCCCCTTTG TTAAAAGAGC 100 

AGATTGAAAC CATAGTGACG GGTAAAAAGG CCACTGTAGG TGTAGCAGTG 150 

TGGGGGCCTG ACGATCTGGA ACCTTTGTTG CTGAATCCAT TTGAAAAGTT 2 00 

TCCGATGCAA AGTGTGTTTA AACTGCATTT AGCTATGTTA GTTCTGCATC 250 

AGGTCGATCA GGGGAAACTG GATTTAAATC AGTCTGTTAC TGTTAATCGT 300 

40 GCTGCAGTAT TACAAAATAC CTGGTCGCCA ATGATGAAAG ATCATCAGGG 350 

CGATGAATTT ACTGTTGCAG TACAGCAGTT ACTGCAGTAT TCGGTGTCAC 4 00 

ACAGCGACAA TGTGGCCTGC GATTTGTTAT TTGAACTGGT GGGCGGGCCG 4 50 

CAAGCTTTGC ATGCTTATAT CCAGTCTTTA GGCGTTAAAG AAGCTGCCGT 500 

GGTAGCAAAT GAAGCGCAAA TGCATGCGGA TGATCAGGTG CAAT AT C AAA 550 

45 ACTGGACGTC GATGAAAGCC GCAGCACAAG TTCTGCAAAA GTTTGAACAG 600 

AAAAAGCAGT TGTCTGAAAC CTCTCAGGCC TTGTTATGGA AATGGATGGT 650 

TGAAACCACC ACAGGACCAC AGCGGTTAAA AGGCTTGTTA CCTGCTGGTA 7 00 

CTATAGTGGC GCATAAAACC GGTACTTCGG GCGTCAGAGC AGGAAAAACT 750 

GCGGCGACTA ATGATGCGGG CGTCATTATG TTGCCTGATG GACGGCCTTT 800 

50 ATTGGTGGCG GTATTTGTCA AGGATTCGGC TGAATCAGAA CGAACCAATG 8 50 

AAGCTATTAT TGCGCAGGTT GCGCAAGCGG CTTATCAGTT TGAGCTGAAA 900 

AAACTCTCTG CAGTGAGTCC GGATTGA 927 
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2) INFORMATION FOR SEQ ID NO: 1446 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1446 
GGCCTGYGAT TTGTTATTTG AACTGGT 



2) INFORMATION FOR SEQ ID NO: 1447 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1447 
CGCTSTGGTC CTGTGGTGGT TTC 



2) INFORMATION FOR SEQ ID NO: 1448 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1448 
GATCAGGTGC ARTATCAAAA CTGGAC 



2) INFORMATION FOR SEQ ID NO: 14 4 9 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 bases 

(B) TYPE: Nucleic acid 
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WO 01/23604 PCT/CA00/01 1 50 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1449 
AGCWGGTAAC AAYCCTTTTA ACCGCT 26 



2) INFORMATION FOR SEQ ID NO: 1450 



(i) SEQUENCE CHARACTERISTICS: 
15 (A) LENGTH: 26 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

20 (ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1450 
ACCACTGGGA ATACACTTGT AATGGC 26 

25. 

2) INFORMATION FOR SEQ ID NO: 1451 

3 0 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

35 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1451 
40 ATCTACCTGG TCAATCATTG CTTCGT 26 



2) INFORMATION FOR SEQ ID NO: 1452 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 486 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
50 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 
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<vi) ORIGINAL SOURCE: 

(A) ORGANISM: Staphylococcus epidermidls 

(B) STRAIN: BM10393 

(C) ACCESSION NUMBER: AF045472 

5 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1452 

ATGACATTAT CAATAATTGT CGCTCACGAT AAACAAAGAG TCATTGGGTA 50 

CCAAAATCAA TTACCTTGGC ACTTACCAAA TGATTTAAAG CATATTAAAC 100 

10 AACTGACCAC TGGGAATACA CTTGTAATGG CACGGAAAAC TTTTAATTCT 150 

ATAGGGAAGC CATTGCCAAA TAGACGTAAC GTCGTACTCA CTAACCAAGC 200 

TTCATTTCAC CATGAAGGGG TAGATGTTAT AAACTCTCTT GATGAAATTA 250 

AAGAGTTATC TGGTCATGTT TTTATATTTG GAGGACAAAC GTTATACGAA 300 

GCAATGATTG ACCAGGTAGA TGATATGTAT ATCACAGTAA TAGATGGAAA 350 

15 GTTTCAAGGA GACACATTCT TTCCACCATA CACATTCGAA AACTGGGAAG 4 00 

TCGAATCTTC AGTAGAAGGT CAACTAGATG AAAAAAATAC TATACCGCAT 4 50 

ACATTCTTAC ATTTAGTGCG TAGAAAAGGG AAATAG 486 



20 



30 



35 



2) INFORMATION FOR SEQ ID NO: 1453 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 2 6 bases 
25 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1453 
ATCGAAGAAT GGAGTTATCG GRAATG 26 

2) INFORMATION FOR SEQ ID NO: 1454 



(i) SEQUENCE CHARACTERISTICS: 
40 (A) LENGTH: 25 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

45 (ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1454 
CCTAAAAYTR 'CTGGGGATTT CWGGA 25 

50 



2) INFORMATION FOR SEQ ID NO: 1455 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1455 
CAGGTGGTGG GGAGATATAC AAAA 



2) INFORMATION FOR SEQ ID NO: 1456 

(i) SEQUENCE CHARACTERISTICS: 
. (A) LENGTH: 26 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1456 
TATGTTAGAS RCGAAGTCTT GGKTAA 



2) INFORMATION FOR SEQ ID NO: 1457 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1457 
CAAAGGTGAA CAGCTCCTGT TT 



2) INFORMATION FOR SEQ ID NO: 1458 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 
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(D) TOPOLOGY: Linear 
(ii) MOLECULE TYPE: DNA 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1458 
TCCGTTATTT TCTTTAGGTT GGTTAAA 



2) INFORMATION FOR SEQ ID NO: 1459 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1459 
AAGGTGAACA GCTCCTGTTT 



2) INFORMATION FOR SEQ ID NO: 1460 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14 60 
GATCAC TACG TTCTCATTGT CA 



2) INFORMATION FOR SEQ ID NO: 1461 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 474 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 
(vi) ORIGINAL SOURCE: 
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(A) ORGANISM: Escherichia coli 
(C) ACCESSION NUMBER: AJ238350 



(xi) SEQUENCE DESCRIPTION: SEQ 

GTGAAACTAT CACTAATGGT AGCTATATCG 
TGGCCCTGAT ATTCCATGGA GTGCCAAAGG 
CTATTACCTA TAACCAATGG CTGTTGGTTG 
ATGGGAGCAT TACCCAACCG AAAGTATGCG 
TACATCTGAC AATGAGAACG TAGTGATCTT 
TAACCAACCT AAAGAAAATA ACGGATCATG 
GAGATATACA AAAGCCTGAT CGATCAAGTA 
AATAGACATC GAGCCGGAAG GTGATGTTTA 
ATTTTAGGCC AGTTTTTACC CAAGACTTCG 
TACCAAATCT GGCAAAAGGG TTAA 



ID NO: 1461 

AAGAATGGAG TTATCGGGAA 50 

TGAACAGCTC CTGTTTAAAG 100 

GACGCAAGAC TTTTGAATCA 150 

GTCGTAACAC GTCCAAGTTT 200 

TCCATCAATT AAAGATGCTT 250 

TCATTGTTTC AGGTGGTGGG 300 

GATACACTAC ATATATCTAC 350 

CTTTCCTGAA ATCCCCAGCA 400 

CCTCTAACAT AAATTATAGT 4 50 

474 



2) INFORMATION FOR SEQ ID NO: 1462 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14 62 



GCACTCCCYA ATAGGAAATA CGC 



2) INFORMATION FOR SEQ ID NO: 14 63 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14 63 
AGTGTTGCTC AAAAACAACT TCG 

2) INFORMATION FOR SEQ ID NO: 1464 

(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid' 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

5 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1464 
10 ACGTTYGAAT CTATGGGMGC ACT 23 



15 



45 



2 ) INFORMATION FOR SEQ ID NO: 1465 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 
20 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1465 

25 

GTCGATAAGT GGAGCGTAGA GGC 23 

30 2) INFORMATION FOR SEQ ID NO: 1466 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 
35 (C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 
40- (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1466 

AAGCATTGAC CTACAATCAG TGT 23 



2) INFORMATION FOR SEQ ID NO: 1467 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 27 bases 
50 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 
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